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An expression vector for expressing a protein or polypeptide in a bacterium, which comprises a first DNA sequence encod- 
ing at least a secretion signal of a lipoprotein, and a second DNA sequence encoding a protein or fragment thereof, or polypep- 
tide or peptide heterologous to the bacterium which expresses the protein or fragment thereof, or polypeptide or peptide. The bac- 
terium expresses a fusion protein of a lipoprotein or lipoprotein segment and the protein or fragment thereof, or polypeptide or 
peptide heterologous to the bacterium which expresses the protein or fragment thereof, or polypeptide or peptide. Such expres- 
sion vectors increase the immunogenicity of the protein or fragment thereof, or polypeptide or peptide by enabling the protein or 
fragment thereof, or polypeptide or peptide to be expressed on the surface of the bacterium. Bacteria which may be transformed 
with the expression vector include mycobacteria such as BCG. The expression vectors of the present invention may be employed 
in the formation of live bacterial vaccines against Lyme disease wherein the bacteria express a surface protein of Borrelia burg- 
dorferi^ the causative agent of Lyme disease. 
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BACTERIAL EXPRESSION VECSORB 
CONTAINING DNA ENCODING SECRETION SIGNALS 
OF LIPOPROTEINS 

This application is a continuatioaa-in-part of application 
Serial No. 780, 261, filed October 21, 1991. 

This invention relates to expression vectors for expressing 
a protein in a bacterium, such as for example, a mycobacterium. 
More particularly, this invention relates to expression vectors 
for expressing and secreting proteins which are heterologous to 
the bacterium which expresses such proteins wherein such vectors 
further include DNA encoding at least the secretion signals of 
lipoproteins designed to achieve lipid acylation and surface 
expression of heterologous proteins. 

Certain mycobacteria represent major pathogens of man and 
animals. For example, tuberculosis is generally caused in humans 
by Mycoba cterium tuberculosis , and in cattle by Mycobacterium 
bpvis, which may also be transmitted to himans and other animals. 
Mycobac teria leprae is the causative agent of leprosy. 
M. tuberc ulosis and mycobacteria of the avium-intracellulare- 
scrofulaceum group (MAIS group) represent major opportunistic 
pathogens of patients with acquired immune deficiency syndrome 
(AIDS). M . pseudotuberculosis is a major pathogen of cattle. 

On the other hand, Baqille Calraette-Guerin, or BOG, an 
avirulent strain of M.bovis , is widely used in human vaccines, 
and in particular is used as a live vaccine, which is protective 
against tuberculosis. BCG is the only childhood vaccine which is 
currently given at birth, has a very low incidence of adverse 
effects, and can be used repeatedly in an individual. (eg., in 
multiple forms). In addition, BCG and other mycobacteria (eg., 
M. s megm atls ) , employed in vaccines, have adjuvant properties 
among the best currently known and, therefore, stimulate a 
recipient's immune system to respond to antigens with great 
effectiveness. 
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It has been suggested by Jacobs, et. ai. Nature, Vol. 327, 
Mo 6122, pgs. 532-535 (June 11, 1987), that BCG could be used as 
a host for -the construction of recombinant vaccines. In other 
words, it vas suggested to take an existing vaccine (in this case 
against tuberculosis) and expand its protective repertoire 
through the introduction of one or more genes from other 
pathogens. 

Transformation, the process whereby naked DNA is introduced 
into bacterial cells, has been carried out successfully xn 
mycobacteria. Jacobs, et al (1987), as hereinabove cited, have 
described transformation of mycobacteria by electroporation. 
Electroportation can give from 10^ to 10^ transformants per m of 
plasmid DNA and such plasmid DNA' s may carry genes for resistance 
to antibiotic markers such as kanamycin. Snapper, et al, PNAS, 
Vol. 85, pgs. 6987-6991 (September, 1988); to allow for selection 
of transformed cells from non-transformed cells. 

Jacobs, et al (1987) and Snapper, et al (1988) have also 
described the use of cloning vehicles such as plasmids and 
bacteriophages, for carrying genes of interest into mycobacteria. 

Le^, et al., PNAS. Vol. 88, pgs. 3111-3115 (April 1991), 
describe vectors which employ DNA encoding a mycobacterial phage 
int^grase and phage attachment site to effect site-specific 
integration into a mycobacterial chromosome. Such vectors permit 
stable integration of vectorss encoding foreign antigen genes 
into a mycobacterial chromosome. 

Stover, et al., (Nature , Vol. 351, pgs. 456-460 (June 6, 
1991)) describe Integrative and extrachromosomal vectors 
employing mycobacterial HSP60 and HSP70 promoters to express 
foreign antigens cytoplasmically in recombinant BCG. Stover, et 
al. demonstrated that recombinant BCG expressing foreign antigens 
with these vectors could be used as immunogens to generate 
humoral and cellular immune responses to the foreign antigens. 
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Combination of the above-mentioned techniques, along with 
standard tools of molecular cloning (e.g,^ use of restriction 
enzymes, etc.) allows the cloning of genes of interest into 
vectors and introduction of such genes into mycobacteria. 

In accordance with an aspect of the present invention^ there 
is provided an expression vector for expressing a protein or 
polypeptide or peptide in a bacterium. The expression vector 
comprises a first DNA sequence encoding at least a secretion 
signal of a lipoprotein; and preferably further comprises a 
second DNA sequence encoding a protein or fragment thereof or 
polypeptide or peptide heterologous to the bacterium which 
expresses the protein or fragment thereof, or polypeptide or 
peptide, whereby the bacterium expresses a fusion protein of a 
lipoprotein or lipoprotein segment (which may include the 
secretion signal), and the protein or fragment thereof, or 
polypeptide or peptide heterologous to the bacterium which 
expresses the protein or polypeptide or peptide. 

Such an expression vector may be employed in any of a 
variety of bacteria which may be employed in vaccines, including 
live vaccines. In particular, in one embodiment, the bacterium 
is iR mycobactrium such as, but not limited to, Mycobac terium 
bpvis - BCG, M, sm eq matis . M. avium , M.phlei , M> f ortuitium , M. l ufu. 
M. par atuberculosis , M.habana , M. scrof alaceum , M. int race llul ar e, 
and M . vaccae . 

In one embodiment, the mycobacterium is M,bovls *BCG. 

Although the scope of the present invention is not to be 
limited to any theoretical reasoning, it is believed that the 
signal sequence of the lipoprotein enables the expressed 
recombinant fusion protein to be modified such that the protein 
is expressed at the surface of the bacterium as a chimeric 
lipoprotein. For example, the fusion protein may include 
processing or recognition site(s) for signal peptidase II in the 
signal sequence portion, which enables lipid acylation of the 
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fusion protein- Such lipid acylation of the fusion protein may 
enhance the inununogenicity of the heterologous protein or 
fragment thereof, or polypeptide or peptide portion of the fusion 
protein. Also, the signal sequence enables the fusion protein to 
be expressed at and anchored to the surface of the bacterium, 
thus making the heterologous protein or polypeptide more 
accessible, which also may increase the immunogenicity of the 
protein or fragment thereof, or polypeptide or peptide. Also, 
because such fusion proteins may be ejcpressed on the surface of 
the bacterium, such expression or secretion of the fusion protein 
will permit the expression of antigens which may be lethal if 
expressed or maintained cytoplasmically in the bacterium. It is 
to be understood that the heterologous protein or fragment 
tliereof, or polypeptide or peptide may itself be a lipoprotein, 
such as the OspA antigen of Borrelia burgdorferi, which is 
hereinafter discussed, or a non-lipoprotein, such as, for 
example, HIV antigens, tetanus toxoids, diphtheria toxoids, 
cholera toxoids, pertussis toxoids, and malarial antigens. Thus, 
the expression vectors of the present invention enable the 
genetic engineering of a non-lipoprotein moiety which may become 
anchored to the surface of a bacterium. 

Thus, the expression vectors enable the expression of 
heterologous genes or gene segments (which originally encoded 
non- lipoproteins) as chimeric surface lipoproteins. This is 
accomplished by gene fusion of the foreign genes or gene segments 
to vector encoded genes or gene segments encoding lipoproteins or 
lipoprotein signal peptides, respectively. 

In one embodiment, the first DNA se<^ence encodes at least a 
secretion signal of a mycobacterial lipoprotein. The 
mycobacterial lipoprotein may, in one embodiment, be an 
M. tuberc ulosis lipoprotein. The M. tuber culosis lipoprotein may 
be selected from the group consisting of the H. tuberculosis 19 
kda antigen and the M. tuberculosis 38 kda antigen. 
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Other lipoproteins, of which at least the secretion signal 
may be encoded by the first DNA sequence include, but are not 
limited to^ Braun's lipoprotein of E. coli . S, marcescens ^ E, 
amylosora, M. morqanii ^ and P, mirabilis , the TraT protein of E. 
coli and Salmonella ; the penicillinase (PenP) protein of B, 
1 i cheni f o rmi s and B, cereus and S. aureus ; pullulanase proteins 
of Kleb si el la pneumoni ae and Klebsiella ae r oqene se ; E. coli 
lipoproteins lpp-28. Pal, RplA, RplB, OsmB, NlpB, and Orll7; 
chitobiase protein of V. harsevi ; the P-1 , 4-endoglucanase protein 
of Pseudo monas solanacearum , the Pal and Pep proteins of 
influenzae; the OprI protein of P. aeruginosa ; the MalX and AmiA 
proteins of S. pneumoniae ; the 34 kda antigen and TpmA protein of 
Treponem a pallidu m; the P3 7 protein of Mycoplasma hvorhini s ; and 
the 17 kda antigen of Rickettsia rickettsii . It is to be 
understood, however, that the scope of the present invention is 
not to be limited to secretion signals of any particular 
lipoprotein or lipoproteins. 

In one embodiment, the first DNA sequence may further 
include DNA which encodes all or a portion of the lipoprotein. 
Thus, in such an embodiment, the fusion protein which is 
expressed by the bacterium is a fusion protein of the secretion 
signal of the lipoprotein, all or a portion of the lipoprotein, 
and the heterologous protein or polypeptide or peptide. 

The first and second DNA sequences are under the control of 
a suitable promoter. In one embodiment, the promoter may be the 
19 kda antigen promoter or the 38 kda antigen promoter of 
M. tube rculosis if DNA encoding the secretion signal of one of 
these antigens is employed. Alternatively, the promoter may be a 
mycobacterial promoter other than the 19 kda and 38 kda 
M. tube r culosus antigen promoters, or a mycobacteriophage 
promoter . 

Mycobacterial and mycobacteriophage promoters which may be 
employed include, but are not limited to, mycobacterial promoters 
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such as the BCG HSP60 and HSP70 promoters; the mycobactin 
promoter from m i-nberculosis and BCG; the mycobacterial 14 kda 
and 12 kda antigen promoters; the mycobacterial a-antigen 
promoter from" M. tuberculosis or BCG; the MBP-70 promoter, the 
mycobacterial 45 kda antigen promoter from H^tuberculosis or BCG; 
the superoxide dismutase promoter; the mycobacterial asd 
promoter, and mycobacteriophage promoters such as the Bxbl, Bxb2, 
Bxb3, LI, L5, D29 and TM4 promoters. In one embodiment, the 
promoter is a mycobacterial heat shock protein promoter such as 
HSP60 or HSP70. 

Example of expression vectors Including the mycobacterial 
promoters and mycobacteriorphage promoters hereinabove described 
are further described in application Serial No. 642,017, filed 
January 16, 1991, which is a continuation of application Serial 
No. 552,828, filed July 16, 1990, now abandoned. The contents of 
application Serial No. 642,017 are hereby incorporated by 
reference . 

In a preferred embodiment, the transcription initiation 
site, the ribosomal binding site, and the start codon, which 
provides for the initiation of the translation of mRNA, are each 
of mycobacterial origin. The stop codon, which stops translation 
of mRNA, thereby terminating synthesis of the heterologous 
protein, and the transcription termination site, may be of 
mycobacterial origin, or of other bacterial origin, or may be 
synthetic in nature, or such Stop codon and transcription 
termination site may be those of the DNA encoding the 
heterologous protein or polypeptide. 

Preferably, the mycobacterial promoter is a BCG promoter, 
and the mycobacterium is BCG. 

Heterologous proteins or polypeptides which may be encoded 
by the second DNA sequence include, but are not limited to, 
antigens, anti-taimor agents, enzymes, lymphokines, pharmacologic 
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agents, immunopotentiators, and reporter molecules of interest in 
a diagnostic context. 

Antigens which may be encoded include, but are not limited 
to, My cobacterium leprae antigens; Mycobacterium tuberculosis 
antigens; Rickettsia antigens; Chlamydia antigens; Coxiella 
antigens; malaria sporozoite and merozoite proteins, such as the 
circumsporozoite protein from P 1 a smodium be r ghe i sporozoites; 
diphtheria toxoids; tetanus toxoids; Clostridium antigens; 
Leishma nia antigens; Salmonella antigens; E . coli antigens; 
Listeria antigens; Borrelia antigens, including the OspA and OspB 
antigens of Borrelia burgdorferi ; Franciscella antigens; Yersini a 
antigens; Mycobacterium af ricanum antigens; M y c ob ac t e r i um 
intracell ulare antigens; Mycobacterium avium antigens; Trepo nema 
antigens; Schistosome antigens; Filaria antigens; Pertussis 
antigens; Staphylococcus antigens; Herpes virus antigens; 
influenza and parainfluenza virus antigens; measles virus 
antigens; Bordatella antigens; Hemophilus antigens; Streptococcus 
antigens, including the M protein of S. pyogenes and pneumococcus 
antigens such as Streptococcus pneumoniae antigens; mumps virus 
antigens; hepatitis virus antigens; Shigella antigens; Neisseri a 
antigens; rabies antigens; polio virus antigens; Rift Valley 
Fever virus antigens; dengue virus antigens; measles virus 
antigens; rotavirus antigens; Human Immunodeficiency. Virus (HIV) 
antigens, including the gag, pol, and env proteins; respiratory 
syncytial virus (RSV) antigens; snake venom antigens; human tumor 
antigens; and Vibrio cholera antigens. Enzymes which may be 
encoded include, but are not limited to, steroid enzymes. 

In one embodiment, the second DNA sequence encodes at least 
one protein or polypeptide or fragment or derivative thereof 
which includes an epitope which is recognized by cytotoxic T 
lymphocytes induced by an HIV protein or fragment or derivative 
thereof. The at least one DNA sequence may encode an HIV protein 
or fragment or derivative thereof. HIV proteins or polypeptides 
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which may be encoded by the at least one DNA sequence includes 
but are not limited to, HIV-I-gp 120; HlV-I-gp 41; HIV-I-gp 160; 
HIV-I-pol; HIV-I-nef; HIV-I-tat; HIV-I-rev; HIV-I-vif; HIV-I-vpr; 
HIV-I-vpu; HIV-I-gag; HIV-2gp 120; HIV-2-gp 160; HIV-2-gp 41; 
HIV-2-gag; HIV-2-pol? HIV-2-iief; HIV-2-tat; HIV-2-rev; HIV-2-vif; 
HIV-2-vpr; HIV-2-vpu; and HIV-2-vpx. 

Anti-tumor agents which may be encoded include, but are not 
limited to, interferon- a , interferon- P, or interferon- , and 
tumor necrosis factor, or TNF, Lymphokines which may be encoded 
include, but are not limited to, inter leukins 1 through 8. 

It is also contemplated that the heterologous protein or 
polypeptide may be a reporter molecule or selectable marker. 

Reporter molecules which may be encoded include, but are not 
limited to, lucif erase, B-galactosidase, B- glucuronidase, and 
catechol dehydrogenase. 

Other peptides or proteins which may be encoded include, but 
are not limited to, those which encode for stress proteins, which 
can be administered to evoke an immune response or to induce 
tolerance in an autoimmune disease (e.g., rheumatoid arthritis). 

Selectable markers which may be encoded include, but are not 
limited to, the p-galactosidase marker, the kanamycin resistance 
marker, the chloroamphenicol resistance marker, the neomycin 
resistance marker, and the hygromycin resistance marker, 
bacteriophage resistance markers, or genes encoding enzymes 
involved in the synthesis of nutritional elements, such as amino 
acids . 

In accordance with one embodiment, the vector further 
includes a mycobacterial origin of replication. 

In accordance with another embodiment, the vector may be a 
plasmid. The plasmid may be a non- shuttle plasmid, or may be a 
shuttle plasmid which further includes a bacterial origin of 
replication such as an E.coli origin of replication, a Bacillus 
origin of replication, a Staphylococcus origin of replication, a 
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Streptomyces origin of replication, or a streptococcal origin of 
replication. In one embodiment, the shuttle plasmid includes an 
E, coli origin of replication , 

In accordance with yet another embodiment, the vector may 
further include a multiple cloning site, and the second DNA 
sequence encoding for the heterologous protein is inserted in the 
multiple cloning site. 

In another embodiment, the expression vector may be, for 
example, a temperate shuttle phasmid or a bacterial-mycobacterial 
shuttle plasmid. Each of these vectors may be used to introduce 
the first DNA sequence encoding at least the secretion signal of 
a lipoprotein and a second DNA sequence encoding a protein or 
fragment thereof, or polypeptide or peptide heterologous to the 
mycobacterium which expresses the protein or fragment thereof, or 
polypeptide or peptide stably into mycobacteria, in which the DNA 
seqeunces may be expressed. When a shuttle phasmid, which 
replicates as a plasmid in bacteria and a phage in mycobacteria, 
is employed, integration of the phasmid, which includes the first 
DNA sequence encoding at least the secretion signal of a 
lipoprotein, and a second DNA sequence endoing a protein or 
fragment thereof, or polypeptide or peptide heterologous to the 
mycobacterium which expresses the protein or fragment thereof, or 
polypeptide or peptide, into the mycobacterial chromosome, occurs 
through site-specific integration. The DNA seqeunces are 
replicated as part of the chromosomal DNA. When a 
bacterial-mycobacterial shuttle plasmid is employed, the DNA 
sequences are stably maintained extrachormosomally in a plasmid. 
Expression of the DNA sequences occur extrachromosomally (e.g., 
episomally). For example, the DNA sequences are cloned into a 
shuttle plasmid and the plasmid is introduced into a 
mycobacterium such as those hereinabove described, wherein the 
plasmid replicates episomally. Examples of such shuttle phasmids 
and bacterial-mycobacterial shuttle plasmids are further 
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desciribed in Application Serial No. 361,944, filed June 5, 1989, 
which is hereby incorporated by reference. 

In addition to the first DNA sequence encoding at least the 
secretion signal of a lipoprotein and the second DNA sequence 
encoding a heterlogous protein or fragment thereof, or 
polypeptide or peptide, and the mycobacterial promoter for 
controlling expression of the DNA encoding the heterologous 
protein or polypeptide, the expression vector may, in one 
embodiment, further include a DNA sequence encoding bacteriophage 
integration into a raycobacterium chromosome. Bacteriophages from 
which the DNA sequence encoding bacteriophage integration into a 
mycobacterium chromosome may be derived include, but are not 
limited to, mycobacteriophages such as but not limited to, the 
L5, 1,1, Bxbl, and TM4 mycobacteriophages; the lambda phage of 
co.li; the toxin phages of Corvnebacteria ; phages of Actinomycetes 
and Norcardia; the #C31 phage of Streptomvces; and the P22 phage 
of Sa lmonella . Preferably, the DNA sequence encodes 
mycobacteriophage integration into a mycobacterium chromosome. 
The DNA sequence which encodes bacteriophage integration into a 
mycobacterium chromosome may include DNA which encodes integrase, 
which is a protein that provides for integration of the vector 
into the mycobacterial chromosome. Preferably, the DNA sequence 
encoding mycobacterial phage integration also includes DNA which 
encodes an attP site. 

The DNA encoding the attP site and the integrase provides 
for an integration event which is referred to as site-specific 
integration. DNA containing the attP site and the integrase gene 
is capable of integrating into a corresponding attB site of a 
mycobacterium chromosome . 

It is to be understood that the exact DNA sequence encoding 
the attP site may vary among different phages, and that the exact 
DNA sequence encoding the attB site may vary among different 
mycobacteria . 
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Examples of DNA which is a phage DNA portion encoding 
bacteriophage integration into a mycobacterium chromosome are 
further described in Application Serial No. 869,330, filed April 
15, 1992, which is a continuation-in-part of Application Serial 
No. 553,907, filed July 16, 1990, now abandoned. The contents of 
Application Serial No. 869,330 are incorporated by reference. 

The vectors of the present invention may be employed to 
transform bacteria, and in particular, mycobacteria which 
include, but are not limited to, Mycobacterium bovia - BCG, M. 
emeqmatis, M. avium . M. phlei . M. fortuitum . M. lufu . m. 
paratuberculosis . M. habana . M. scrof alaceum . M. intracellulare 

and M;^ vaccae; in particular, such vectors may be employed to 

transform BCG. The transformed mycobacteria thus express the 
heterologous protein, which, as hereinabove stated, may be an 
antigen, which induces an immune response, or a therapeutic 
agent. Thus, the transformed mycobacteria may be employed as 
part of a pharmaceutical composition, such as a vaccine and/or 
therapeutic agent, which includes the transformed mycobacteria, 
and an acceptable pharmaceutical carrier. Acceptable 
pharmaceutical carriers include, but are not limited, to mineral 
oil, aliun, synthetic polymers, etc. Vehicles for vaccines and 
therapeutic agents are well known in the art and the selection of 
a suitable vehicle is deemed to be within the scope of those 
skilled in the art from the teachings contained herein. The 
selection of a suitable vehicle is also dependent upon the manner 
in which the vaccine or therapeutic agent is to be administered. 
The vaccine or therapeutic agent may be in the form of an 
injectable dose and may be administered intramuscularly. 
Intravenously, orally, intradermal ly, or by sxibcutaneous 
administration. 

The mycobacteria are administered in an effective amount. 
In general, the mycobacteria are administered in an amount of 
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from about 1 x 10^ to about 1 x 10^° colony forming units (CFU's) 
per dose. 

Other -means for administering the vaccine or therapeutic 
agent should be apparent to those skilled in the art from the 
teachings herein; accordingly, the scope of the invention is not 
to be limited to a particular delivery form. 

As hereinabove noted, the expression vectors of the present 
invention may contain DNA which encodes Borrelia antigen(s), 
including but not limited to surface proteins or antigens of 
Ror-t-eli a burgdorferi , the causative agent of Lyme disease. Thus, 
in accordance with an aspect of the present invention, there is 
provided a method of protecting an animal against Lyme disease 
which comprises administering to an animal mycobacteria 
transformed with DNA which includes at least one DNA sequence 
which encodes a protein or polypeptide which elicits antibodies 
against Borrelia burgdorferi . The mycobacteria are administered 
in an amount effective to protect an animal against Lyme disease. 
Such amounts may be those hereinabove described. In one 
embodiment, the at least one DNA sequence encodes a surface 
protein of Borrelia burgdorferi or a fragment or derivative 
thereof. Surface proteins of Borrelia buroedorferi which may be 
encoded by the at least one DNA sequence, include but are not 
limited to. Outer Surface Protein A and Outer Surface Protein B, 
sometimes hereafter referred to as OspA and OspB, respectively . 

The transformed mycobacteria include those hereinabove 
described. In one embodiment, the mycobacteria are of the species 
M. bo vis- BCG. 

The at least one DNA sequence which encodes a protein or 
polypeptide which elicits antibodies against Borrelia 
burgdorferi, in a preferred embodiment, is contained in a 
mycobacterial expression vector. In one embodiment, the 
mycobacterial expression vector may include a DNA sequence 
encoding at least a secretion signal of a lipoprotein, such as 
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those hereinabove described, and wherein the mycobacterium 
expresses a chimeric fusion protein of the lipoprotein or 
lipoprotein segment (which may include the secretion signal) and 
the protein or polypeptide which elicits antibodies against 
Borr e lia burgdorferi . Such an expression vector enables the 
protein or polypeptide which elicits antibodies against Borrelia 
burgd orferi , to be expressed on the surface of the mycobacterium, 
whereby the protein or polypeptide becomes more accessible. 

It is also contemplated that, in another embodiment, the 
mycobacterial expression vector may contain DNA which encodes all 
or a portion of a mycobacterial excretion protein, as well as the 
DNA which encodes a protein or polypeptide which elicits 
antibodies against Borrelia burgdorferi . The mycobacterium 
expresses a fusion protein of the mycobacterial excretion protein 
or a portion thereof, and the protein or polypeptide which 
elicits antibodies against Borrelia burgdorferi , Such an 
expression vector enables the protein or polypeptide to be 
excreted from the mycobacterium. Examples of mycobacterial 
excretion proteins which may be encoded, include, but are not 
limited to, the a -anti gen of M. tuberculosis and BCG. 

The mycobacterial expression vector, in one embodiment, may 
include a promoter selected from the group consisting of 
mycobacterial promoters and mycobacteriophage promoters, such as 
those hereinabove described, and/or may include a DNA sequence 
encoding bacteriophage integration into a mycobacterium 
chromosome, also as hereinabove described. 

In another embodiment, the mycobacterial expression vector 
may be a plasmid, such as a non-shuttle plasmid or a shuttle 
plasmid which further includes a bacterial origin of replication, 
also as hereinabove described. 

It is also contemplated that the mycobacterial expression 
vector may be a temperate shuttle phasmid or a 
bacterial-mycobacterial shuttle plasmid as hereinabove described. 
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The transformed mycobacteria are employed as part of 
composition for protecting an animal against Lyme disease. Such 
a composition includes the transformed mycobacteria, and an 
acceptable pKararaaceutical carrier such as those hereinabove 
described. 

The invention will now be further described with respect to 
the following examples; however, the scope of the present 
invention is not intended to be limited thereby. 

Example 1 

A. rnnatfuetion of plasmids includin g mycobacterial promoter 
expression cassette. 
1. Construction of PYUB125 

Plasmid pALSOQO, a plasmid which contains an origin of 
replication of M. fortuitiim , and described in Labidi , et al . , 
FEMS M icrobiol. Lett ., Vol. 30, pgs. 221-225 (1985) and in Gene, 
Vol. 71, pgs. 315-321 C1988), is subjected to a partial Sau 3A 
digest, and 5kb fragments are gel purified. A 5kb fragment is 
then ligated to Bam HI digested pIJ665 (an. — coli vector 
containing an E. coli origin of replication and also carries 
neomycin-kanamycin resistance, as described in Kieser, et al.. 
Gene , Vol. 65, pgs. 83-91 (1988) to form plasmid pYUB12. A 
schematic of the formation of plasmid pYUB12 . A schematic of the 
formation of plasmid pYUB12 is shown in Figure 1. pyUB12 and 
pIJ666 were then transformed into M. smeqmatis and BCG. 
Neomycin-resistant transformants that were only obtained by 
pYUB12 transformation confirmed that pALSOOO conferred autonomous 
replication to pIJ666 in M. smeamatis and BCG. 

Shotgun mutagenesis by Snapper, et al (1988, hereinabove 
cited) indicated that no more than half of the pALSOOO plasmid 
was necessary to support plasmid replication in BCG. This 
segment presumably carried open reading frames ORFl and 0RF2, 
identified by Rauzier, et al.. Gene , Vol. 71, pgs. 315-321 
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(1988), and also presumably carried a mycobacterial origin of 
replication. pYUB12 is then digested with Hpal and EcoRV, a 2S86 
bp carrying this region or segment pALBOOO is removed and ligated 
to PvuII digested pYUBB. Plasmid pYUBS (a pBR322 derivative) 
includes an E> coli replicon and a kan (aph) gene. Ligation of 
the 2586 bp pYUB12 fragment to PvuII digested pYUBS results in 
the formation of pYUB53, as depicted in Figure 2. Transformation 
of pYUB53 confirmed that the EcoRV-Hpal fragment, designated 
M. rep, was capable of supporting autonomous replication in BCG. 

Plasmid pYUB53 was then digested with AatI, EcoRV, and PstI 
in order to remove the following restriction sites: 

AatI 5707 

EcoRI 5783 

BamHI 5791 

Sail 5797 

PstI 5803 

PstI 7252 

Sail 7258 

BamHI 7264 

EcoRI 7273 

Clal 7298 

Hindi I I 7304; and 

EcoRV 7460 

Fragment ends are then flushed with T4 DNA polymerase and 
religated to form plasmid pyUB125, construction of which is shown 
in Figure 3 . 

2 . Elimination of superfluous vector DNA from pYUB12 5 

792 bases of the tet gene, which had been inactivated by 

prior manipulations, was eliminated by a complete Narl digest, 

gel purification of the 6407 bp fragment, and 

ligation/recirculation, transformation of E. coli strain HBlOl, 

R 

and selection of Kan transf ormants . The construction of 
resulting plasmid, pMVlOl, is schematically indicated in Figure 
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4, and the DNA sequence of pMVlOl, which includes markings of 
regions which will be deleted, and of mutations, as hereinafter 
described, is shown in Figure 5. 

r^, Klimination of und esirable restriction sites in aph 

(kan^y gene. 

To facilitate future manipulations, the Hindi II and Clal 
restriction sites in the aph gene were mutagenized simultaneously 
by polymerase chain reaction (PGR) mutagenesis according to the 
procedure described in Gene, Vol. 77 pgs. 57-59 (1989). The 
bases changed in the aph gene were at the third position of 
codons (wobble bases) within each restriction site and the base 
substitutions made were designed not to change the amino acid 
sequence of th& encoded protein. 

Separate PGR reactions of plasmid pMVXQl with primers 
ClaMut-Kan + HindRMut-Kan and HindFMut-Kan + Barn-Kan were 
performed at 94«C (1 min.), 50«»C (1 mtn. ) , and 72«'C (1 min. ) for 
25 cycles. The PGR primers had the following base sequences: 

ClaMut-Kan 

CTT GTA TGG GAA GCC CC 
Hi ndRMut-Kan 

GTG AGA ATG GGA AAA GAT TAT GCA TTT GTT TGG AG 
H indFMut-Kan 

GTG TGG AAA GAA ATG GAT AAT GTT TTG GGA TTG TGA GCG G 
B am- Kan 

CGT AGA GGA TGG AGA GGA GG 
The resulting PGR products were gel purified and mixed and a 
single PGR reaction without primers was performed at 94°G (1 
min.), 72°C (1 min.) for 10 cycles. Primers GlaMut-Kan and 
Barn-Kan were added and PGR was resumed at 94 *G (1 min.), 50<*G (1 
min.), and 72 °C (2 min.) for 20 cylces. The resulting PGR 
product (Kan. mut) was digested with BamHI and gel purified. 
Plasmid pMVlOl was digested with Clal and cohesive ends were 
filled in by Klenow + dGTP + dGTP. Klenow was heat inactivated 
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and the digest was further digested with BamHI . The 5232 base 

pair fragment was gel purified and mixed with fragment Kan.mut 

and ligated. The ligation was transformed into E. coli strain 
R 

HBlOl and Kan colonies were screened for plasmids resistant to 
Clal and Hindlll digestion. Such plasmids were designated as 
pMVllO, which is depicted in Figure 4. 

4. Elimination of sequences not necessary for plasmid 

replication in mycobacteria, 

Plasmid pMVllO was resected in separate constructions to 
yield plasmids pMVlll and pMV112- In one construction, pMVllO 
was digested with Narl and Ball, the ends were filled in, and a 
5296 base pair fragment was ligated and recircularized to form 
pMVIll. In another construct, pMVllO was digested with Ndel and 
SplI, the ends were filled in, and a 5763 base pair fragment was 
ligated and recircularized to form pMV112 . SGhematics of the 
constructions of pMVlll and pMV112 are shown in Figure 6. These 
constructions further eliminated superfluous E. coli vector 
sequences derived from pAL5000 not necessary for mycobacterial 
replication. Cloning was performed in E. coli . Plasmids pMVlll 
and pMV112 were tested for the ability to replicate in M. 
sraegmati^s. Because both plasmids replicated in M. smegmatis the 
deletions of each plasmid were combined to construct pMV113. 
(Figure 6). 

To construct pMV113, pMVlll was digested with BamHI and 
EcoRI, and a 1071 bp fragment was isolated. pMVll2 was digested 
with BamHI and EcoRI, and a 3570 bp fragment was isolated, and 
then ligated to the 1071 bp fragment obtained from pMVlll to form 
pMV113, These constructions thus defined the region of pALSOOO 
necessary for autonomous replication in mycobacteria as no larger 
than 1910 base paris. 

5. Mutagenesis of restriction sites in mycobacterial 

repllcon. 
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To facilitate further manipulations of the mycobacterial 
replicon, PGR mutagenesis was performed as above to eliminate the 
Sal I, EcoRI, and Bglll sites located in the open reading frame 
known' as ORFl of pALSOOO. PGR mutagenesis was performed at 
wobble bases within each restriction site and the base 
substitutions were designed not to change the amino acid sequence 
of the putative encoded ORFl protein. The restriction sites were 
eliminated one at a time for testing in mycobacteria. It was 
possible to eliminate the Sail and EcoRI without altering 
replication in M. smeomatis . In one construction PGR mutagenesis 
was performed at EcoRIlOTl of pMV113 with primers Eco Mut - M.rep 
and Bam-M.rep to form pMV117. which lacks the EcoRIlOTl site. 
Primer Eco Mut - M.rep has the following sequence: 
TCC GTG CAA CGA GTG TCC CGG A; 
and Bam-M.rep has the following sequence: 
CAC CCG TCC TGT GGA TCC TGT AC. 
In another construction, PGR mutagenesis was performed at 
the Sail 1389 site with primer Sal Mut - M.rep and Bam-M.rep to 
form PMV119, which lacks the Sail 1389 site. Primer Sal Mut- 
M.rep has the following sequence: 

TGG CGA CGG GAG TTA GTG AGG GGT. 
pMVll? was then digested with ApaLI and Bglll, and a 3360 bp 
fragment was isolated. pMV119 was digested with ApaLI and Bglll, 
and a 1281 bp fragment was isolated and ligated to the 3360 bp 
fragment isolated from pMVllT to form pMV123. A schematic of the 
constructions of plasmids pMVll?, pMVll9, and pMV123 is shown in 
Figure 7. Elimination of the Bglll site, however, either by PGR 
mutagenesis or Klenow fill in, eliminated plasmid replication in 
mycobacteria, thus suggesting that the Bglll site is in proximity 
to, or within a sequence necessary for mycobacteria plasmid 

replication. 

6. Construction of pMV200 series v ectors. 
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To facilitate manipulations of all the components necessary 

for plasmid replication in E. coli and mycobacteria, (E. rep. and 

M. rep.) and selection of recombinants (Kan ), cassettes of each 

component were constructed for simplified assembly in future 

vectrs and to include a multiple cloning site (MCS) containing 

unique restriction sites and transcription and translation 

terminators. The cassettes were constructed to allow directional 

cloning and assembly into a plasmid where all transcription is 

unidirectional . 
R 

Kan Cassette 

A DNA cassette containing the aph (Kan ) gene was 

5 3 

constructed by PGR with primers Kan ' and Kan ' . An Spel site 
was added to the 5* end of the PGR primer Kan3 ' , resulting in the 
formation of a PGR primer having the following sequence: 
GTC GAG TAG TGA GGT GTG CGT GGT GAA G. 

Bam HI + Nhel sites were added to the 5' end of the primer 
Kan5 ' , resulting in the formation of a PGR primer having the 
following sequence : 

GAG AGO ATC CTT AGC TAG CCA CT GAG GTC GGG G. 

PGR was performed at bases 3375 and 4585 of pMV123, and 
BamHI and Nhel sites were added at base 3159, and an Spel site 
was added at base 4585. Digestion with BamHI and Spel, followed 
by purification resulted in a 1228/2443 Kan cassette bounded by 
BamHI and Spel cohesive ends with the direction of transcription 
for the aph gene proceeding from BamHI to Spe I . 

E , rep , cassette 

A DNA cassette containing the GolEI replicon of pUC19 was 
constructed by PGR with primers E.rep/Spe and E.rep/Mlu, An Spel 
site was added to the 5' end of PGR primer E.rep/Spe and an Mlul 
site was added to the 5' end of PGR primer E.rep,/Mlu, The 
resulting primers had the following sequences: 

E . rep. /Spe 

GGA GTA GTT CCA GTG AGG GTC AGA GGG 
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GAC AAC GCG TTG CGC TCG GTC GTT CGG CTG. 

PGR was perforffled at bases 713 and 1500 of pUC19, and an 
Mlul site was- added to base 713, and a Spel site was added to 
base 1500- Digestion with Mlul and Spel, followed by 
purification resulted in an E.rep. cassette bounded by Spel and 
Mlul cohesive ends with the direction of transcription for RNA I 
and RNA II replication primers proceeding from Spel to Mlul. 

M . rep . cas sette 

A DNA cassette containing sequences necessary for plasmid 
replication in mycobacteria was constructed by PGR of pMV123 with 
primers M.rep/Mlu and M. rep/Bam. An Mlul site was added to the 
5' end of PGR primer M. rep/Mlu. A BamHI site was added to the 5' 
end of PGR primer M/rep/Bam. The resulting PGR primers had the 
following base sequences: 

M . rep. /Mlu 

CCA TAG GCG TGA GCG GAG GAG GTC CG 
M. rep. /Bam 

GAG GCG TCG TGT GGA TCG TCT AC 

PGR was performed at bases 134 and 2082 of pMV123 . An Mlul 
sited was added to base 2082. Digestion with BamHI and Mlul, 
followed by gel purification resulted in a. 1935 base pair DNA 
cassette bounded by Mlul and BamHI cohesive ends with the 
direction of transcription for the pALSOOQ ORFl and 0RF2 genes 
proceeding from Mlul to Bam HI . 

The Kan^, E.rep, and M.rep PGR cassettes were then mixed in 
equimolar concentrations and ligated, and then transformed in 
coii strain HBlOl for selection of Kan^ transf ormants . Colonies 
were screened for the presence of plasmids carrying all three 
cassettes after digestion with BamHI + Mlul + Spel and designated 
pMV200. An additional restriction site, Ncol, was eliminated 
from the M.rep cassette by digestion of pMV200 with Ncol, fill in- 
with Klenow, and ligation and recircularization, resulting in the 
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formation of pMV201. A schematic of the formation of pMV200 from 
pMV123 and pUC19, and of pMV201 from pMV200, is shown in Figure 
8. Plasmids pMV200 and pMV201 were transformed into M. smegmatis 
and BCG. Both plasmids yielded Kan transf orraants^ thus 
indicating their ability to replicate in mycobacteria, 

A synthetic multiple cloning sequence (MCS) (Figure 9) was 
then designed and synthesized to facilitate versatile molecular 
cloning and manipulations for foreign gene expressions in 
mycobacteria, and for integration into the mycobacterial 
chromosome. The synthetic MCS, shown in Figure 9, contains 16 
restriction sites unique to pMV201 and includes a region carrying 
translation stop codons in each of three reading frames, and a Tl 
transcription terminator derived from E. coli rrnAB ribosomal RNA 
ope r on. 

To insert the MCS cassette, pMV201 was digested with Narl 
and Mhel, and the resulting fragment was gel purified. The MCS 
was digested with HinPI and Nhel and, the resulting fragment was 
gel purified. The two fragments were then ligated to yield 
pMV204. A schematic of the construction of pMV204 is shown in 
Figure 10. 

Flasmid pMV204 was then further manipulated to facilitate 
removal of the M.rep cassette in further constructions. pMV204 
was digested with Mlul, and an Mlul - Not I linker was inserted 
into the Mlul site between the M^rep and the E.rep to generate 
pMV206. A schematic of the construction of pMV206 from pMV204 is 
shown in Figure 11, and the DNA sequence of pMV206 is given in 
Figure 12. 

7- Insertion of BCG HSP60 promoter sequence . 

The published sequence of the BCG HSP60 gene (Thole, et al . , 
Infe c t, and Immun, , Vol. 55, pgs. 1466-1475 {June 1987)), and 
surrounding sequence permitted the construction of an HSP60 
promoter fragment by PCR. The 251 bp HSP50 promoter fragment 
(Figure 13, and as published by Stover, et al. (1991)) was 
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amplified by PGR with primers including added Xbal and Nhel 
sites The PGR HSP60 fragment is then digested with Xbal and 
Nhel/and ds ligated into Xbal digested pMV206 to form pRB26 

(Figure 14)- , , 

8 T..o.-t:inn of ^^^^ ^n.odina th. 1 9 kda M tuberculosxs 

Qsx>A gene into mycob.ct^ri.al expression 

vector- 

^The sequence of the 19 kda M tuberculosis gene is given m 
Ashbridge, et al., NnrlPir ftctds Research, Vol. 17, pg. 1249 
(1989) The 19 kda antigen gene ribosomal binding site, start 
codon, and signal sequence from M. tuberculosis chromosomal DNA 
were amplified by PCR with nucleotide primers. The resulting 153 
bp fragment (Figure 15) obtained by PGR includes added Bglll (5 ) 
and BamHIr EcoRI sites (3'). This fragment contains the entire 
5' region of the 19 kda gene up to the 27th codon with the 
exception of the promoter sequence. The PGR fragment is digested 
with Bglll and EcoRI and ligated into BamHI-EcoRI digested pRB26 
to form p2619S (Figure 16). 

The gene encoding the OspA antigen is described in 
Bergstrom, et al.. Molecular Microbiology, Vol. 3, No. 4, pgs. 
479-486 (1989). The OspA gene sequence, excluding only the 
N-terminal 18 codons (encoding the secretion signal) was derived 
by PCR with added BamHI (5") and Sail (3') sites to provide a 780 
bp OspA fragment. p2619S was digested with BamHI and Sail, and 
the 780bp PCR OspA fragment was digested with BamHI and Sail to 
generate cohesive ends and ligated to BamHI and Sail digested 
p2619S to form p2619: :OspA. (Figure 17). 

Example 2 
fTnns-bructlon of m^vcob acterlal vector 
including promoter and DN A encoding signal 
sequence of 19 kda V . tuberculosis antigen 



wo 93/07897 



23 



PCr/US92/09075 



Plasmid pMV206 was constructed as described in Example 1. 
The 19 kda M. tuberculosis antigen gene promoter/ ribosomal 
binding site, start codon, and secretion signal was amplified by 
PGR with nucleotide primers- The PGR fragment includes added 
Xbal and BamHI sites. This sequence, shown in Figure 18, which is 
286 bp in length, includes the entire published 5' region of the 
19 kda gene up to the 27th codon* The PGR fragment was then 
digested with Xbal and BamHI, and ligated into Xbal and BamHI 
digested pMV206 to form pl9PS (Figure 19). The 780 bp OspA PGR 
cassette, as described in Example 1, was digested with BamHI and 
Sail, and ligated to BamHI and Sail digested pl9PS to form 
pl9PS: rOspA, 

Example 3 

Construction of mycobacterial expression 
vector with M. tuberculosis 38 kda antigen 
promoter and signal secpjience and OspA gene 
The gene sequence for the M. tuberculosis 38 kda antigen is 
given in Andersen, et al.. Infection and Immunity , Vol. 57, No. 
8, pgs. 2481-2488 (Aug. 1989). A DNA sequence encoding the 38 
kda antigen promoter, ribosomal binding site, start codon, and 
secretion signal, obtained from M. tuberculosis chromosomal DMA, 
and containing the entire 5' sequence up to the 45th codon, was 
amplified by PGR with nucleotide primers. The resulting PGR 
fragment includes added Xbal and BamHI sites. The PGR fragment, 
297 bp in length, and shown in Figure 20, was digested with Xbal 
and BamHI, and ligated into Xbal and BamHI digested pMV206 to 
form p38PS (Figure 21). The 780 bp OspA PGR cassette, as 
hereinabove described in Examples 1 and 2, is digested with BamHI 
and Sail and ligated into BamHI and Sail digested p38PS to form 
p38PS : :OspA. 



Example 4 
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n»n«-fi-»gfcioii of myeobacterial expression 
>.r«>fT-h»n wt-fch exp i-ession cassette based on 
BCG HSP60 and OspA gene 
PM7206 was constructed as hereinabove described in Example 

^ The published sequence of the BCG HSP60 gene (Thole, et al, 
T»F.. t-,. and immun. . Vol. 55, pgs. 1466-1475 (June 1987)), and 
surrounding sequence permitted the construction of a cassette 
carrying expression control sequences (i.e., promoter, ribosomal 
binding site, and translation initiation sequences as published 
in Stover, et al. (1991)) by PCR. The BCG HSP61 cassette (Figure 
22) contains 375 bases 5' to the BCG HSP60 start codon, and 15 
bases (5 codons) 3' to the start codon. PCR oligonucleotide 
primers were then synthesized. Primer Xba-HSP60, of the 
following sequence: 

CAG ATC TAG ACG GTG ACC ACA ACG CGC C 
was synthesized for the 5' end of the cassette, and primer 
Bam-HSP61, of the following sequence: 

CTA GGG ATC CGC AAT TGT CTT GGC CAT TG 
was synthesized for the 3' end of the cassette. The primers were 
used to amplify the cassette by PCR from BCG strain Pasteur 
chromosomal DNA. The addition of the Bam HI site at the 3' end 
of the cassette adds one codon (Asp) to the first six codons of 
the HSP60 gene. 

Each of PMV206 and the PCR cassette HSP61 was digested with 
Xbal and BamHI. The PCR cassette was then inserted between the 
Xbal and BamHI sites of pMV206, then ligated to form plasraid 
pIW261. The construction of this plasmid is shown schematically 
in Figure 23. 

The 780 bp OspA PCR cassette as hereinabove described, was 
digested with BamHI and Sail, and ligated to BamHI and Sail 
digested pMV261 to form p2 61:: OspA. 
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Example 5 

A DNA cassette encoding the promoters and transcription 
start sites, as identified in Stover, et al. (1991), ribosome 
binding site/ and start codon of the BCG HSP50 gene, was 
constructed by PGR- Such a cassette is the same as that of the 
BCG HSP61 cassette hereinabove described except that this 
cassette does not include the 15 bases (5 codons) 3' to the start 
codon. This cassette, which is 267 bp in length, and shown in 
Figure 24, includes added Xbal and Ncol sites, with a start codon 
included in the Ncol site. The cassette, after construction, was 
digested with Xbal and Ncol. 

This cassette was placed into Xbal and Ncol digested pMV206 
to form pMV251. (Figure 25). A full length OspA gene (including 
the signal sequence and as published in Bergstrom, et al. (1989)) 
was then derived by PGR as an Ncol -Sal I restriction fragment. 
This fragment was then digested with Ncol and Sail, and ligated 
to Ncol and Sail digested pMV251 to form p251::OspA. 

Example 6 

pRB26 was constructed as described in Example 1. The 38 kda 
antigen gene ribosomal binding site, start codon, and secretion 
signal sequence was obtained from M, tuberculosis chromosomal DNA 
and amplified by PGR with nucleotide primers. The resulting 
fragment also includes added Bglll-BamHI zEcoRI sites. The PGR 
fragment, 210 bp in length (Figure 26), is digested with Bglll 
and EcoRI and ligated into BamHI and EcoRI digested pRB26 to form 
p2638S (Figure 27). p2638S is then digested with BamHI and Sail. 
The 780 bp OspA PGR fragment described in Example 1 is digested 
with BamHI and Sail and ligated to the BamHI and Sail digested 
p2638S to form p2638::OspA. 



Example 7 



wo 93/07897 



26 



PCr/US92/09075 



This example describes the formation of p3638::OspA, which 
includes sequences encoding bacteriophage integration into a 
ntycobacterium chromosome. DNA encoding the secretion signal of 
the 38 kda m -i-„h^^culosis antigen, as well as the OspA gene. 

PMV206 was constructed as hereinabove described in Example 

^ Plasmid PMH9.4, which includes the mycobacteriophage L5 attP 
site and the L5 integrase gene, was employed in providing the L5 
integration sequences to a BCG expression vector. The 
construction of pMH9.4. as well as its integration into 
smegmatis and BCG, is described below in sections (i) through 
(vi ) - 

(i) TH>>ni-,Tficati o" the DNA sequences o f the attachment sites, 
attB , attL. and attR, r ^f M. smegmatis > 

Using standard technologies, a lambda EMBL3^ library was 
constructed using chromosomal DNA prepared from mc 61 (a strain 
of M, smegmatis which includes an M. smegmatis chromosome into 
which has been integrated the genome of mycobacterial phage L5) 
and digested with Bam HI. Phage L5 contains DNA having 
restriction sites identical to those of phage LI (Snapper, et al. 
1988), except that L5 is able to replicate at 42 and phage LI 
is incapable of such growth- This library was then probed with a 
6.7 kb DNA fragment isolated from the L5 genome that had been 
previously identified as carrying the attP sequence (Snapper, et 
al 1988). One of the positive clones was plaque purified, DNA 
prepared, and a 1.1 kb Sal I fragment (containing the AttL 
segvience) sub-cloned into sequencing vector pUC119. The DNA 
sequence of this fragment was determined using a shotgun approach 
coupled with Sanger sequencing. By isolating and sequencing the 
attL junction site and comparing this to the DNA sequence of L5 
that was available, a region was determined where the two 
sequences aligned but with a specific discontinuity present. The 
discontinuity represents one side of a core sequence, which is 
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identical in AttP^ attB, and attL. The region containing the 

recombinational crossover point is shown in Figure 28. 

The attL DNA (1-1 kb Sal I fragment) was used as a probe to 

2 

hybridize to a Southern blot of Bam HI digested mc 6 DNA, which 
is a strain of M. smeqmatis which includes an M. smegmatis 
chromosome without any phage integration (Jacobs, et al, 1987, 
hereinabove cited.)- A single band of approximately 6.4 kb was 

detected corresponding to the attB sequence of M, smeqmatis . 

2 

This same attL probe was used to screen a cosmid library of mc 6 
(provided by Dr. Bill Jacobs of the Albert Einstein College of 
Medicine of Yeshiva University), and a number of positive cosmid 
clones were identified. DNA was prepared from these clones, and 
a 1.9 kb Sal I fragment (containing the attB site) that 
hybridizes to the attL probe was subcloned into pUG119 for 
sequencing and further analysis. The DNA sequence containing the 
core sequence was determined and is shown in Figure 28, The core 
sequence, which is identical in attP, attB and attL, has a length 
of 43bp. 

2 

The mc 61 lambda EMBL3 library was then probed with the 
1.9kb Sail fragment containing the attB site. Positive plaques 
were identified, DNA was prepared, and analyzed by restriction 
analysis and Southern blots. Lambda clones were identified that 
contained a 3.2kb Bam HI fragment containing the putative attR 
site. The 3.2kb Bam HI fragment was purified and cloned into 
pUC119 for sequencing and further analysis. 

.( i i ) Determination of attP-inteqrase region of L5 genome . 

Concurrent with the above procedures, a significant 
portion of the DNA sequence of L5 had been determined and 
represented in several "contigs" or islands of DNA sequence. 
Sequences of the 6 . 7kb Bam HI fragment hereinabove described were 
determined by (a) analysis of the location of Bam HI sites in the 
contigs of the DNA of L5, and (b) by determining a short stretch 
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of DNA sequence from around the Bam HI sites of plasmid pJR-1 
(Figure 33), which carries the 6.7kb Bam HI fragment of L5. 

A segment of DNA sequence was located that represented the 
6.7kb Bam HI fragment of phage L5. Studies of other phage^ have 
shown that the integrase genes are often located . close to the 
attP site. It was thus determined that the L5 integrase (xnt) 
gene should lie either within the 6.7kb Bam HI fragment or xn a 
DNA sequence on either side of it. The DNA sequence in the 
regions was then analyzed by translating it into all six possible 
reading frames and searching these amino acid sequences for 
Similarity to the family of integrase related proteins and 
through computer-assisted analysis of the DNA sequence. As shown 
in Figure 29, there are shown two domains of reasonably good 
conservation among L5 integrase and other integrases, and three 
amino acid residues that are absolutely conserved in domain 2. 
(See Yagil, et al., -T, Mol. Biol. , Vol. 207, pgs. 695-717 (1989), 
and Poyart-Salmeron, et al. , J- EMBO., Vol. 8, pgs. 2425-2433 
(1989)). A region was identified, and analysis of the 
corresponding DNA sequence showed a reading frame that could 
encode for a protein of approximately 333 amino acids. These 
observations identified the putative int gene. 

The location of the int gene was not within the 6.7kb Bam HI 
fragment; however, it was very close to it with one of the Bam HI 
sites (that defines the 6.7kb Bam HI fragment) less than 100 bp 
upstream of the start of the gene. Analysis of the Bam HI sites 
showed that the int gene lay within a 1.9kb Bam HI fragment 
located adjacent to the 6.7kb Bam HI fragment. This 1.9kb Bam HI 
fragment was cloned by purification of the fragment from a Bam HI 
digest of L5 DNA and cloning into pUC 119, to generate pMHl 
(Figure 34) . 

From a combination of the above approaches, a schematic of 
the organization of the attP-int region of L5 was constructed 
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(Figure 30), and the gene sequence of the attP-int region is 
given in Figure 31. 

(iii) Construction of pMH5 , 

The 6.7kb Bam HI fragment of mycobacteriophage L5, which 
contains the attP site, as hereinabove described, was cloned into 
the Bam HI site of pUC 119 (Figure 32). This was achieved by 
purifying the 6.7kb Bam HI fragment from a Bam HI digest of L5 
DNA separated by agarose gel electrophoresis and ligating with 
Bam HI cut pUC 119. DNA was prepared from candidate recombinants 
and characterized by restriction enzyme analysis and gel 
electrophoresis. A recombinant was identified that contained the 
6.7kb Bam HI fragment of L5 cloned into pUC 119. This plasmid 
was named pJR-1, as shown in Figure 33. 

Analysis of DNA sequence data from a project to sequence L5 
showed that a 1 . 9kb Bam HI fragment adjacent to the 6.7kb Bam HI 
fragment hereinabove described contained the integrase gene. 

A plasmid containing a 1.9kb Bam HI fragment containing the 
DNA encoding for the integrase cloned into the Bam HI site of pUC 
119 was constructed. The 1.9kb fragment was purified from a Bam 
HI digest of L5 DNA and cloned into the Bam HI site of pUC 119. 
Construction of the recombinant was determined by restriction 
analysis and gel electrophoresis. This plasmid was called pMHl, 
the construction of which is shown schematically in Figure 34. 

pJR-1 was then modified by digestion with BcoRI and SnaBI 
(both are unique cloning sites), between which is a Bam HI site. 
The Eco Rl-Sna BI fragment, including the Bam HI site was 
excised, and the plasmid was religated to form plasmid of pMH2, 
which contains on Bam HI site compared to two Bam HI sites 
contained in pJR-1. A schematic of the construction of pMH2 is 
shown in Figure 35. 

The 1.9kb Bam HI fragment, which includes the integrase 
gene, was purified from a Bam HI digest of pMHl and ligated to 
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Bam HI digested pMH2. Recombinants were identified as above and 
the orientation of the 1.9kb fragment determined. A plasmid 
called pMH4 was thus constructed (Figure 36) in which the region 
from the Sna BI site (upstream of attP) through to the Bam HI 
site (downstream of the integrase gene) was identical to that in 
L5. 

pMH4 was digested with Hindi II (unique site) and was ligated 
to a Ikb Hindlll fragment purified from pKD43 (supplied by Keith 
Darbyshire of the Nigel Gindley Laboratory) that contains the 
gene determining resistance to kanamycin. Recombinants were 
identified and characterized as above. This plasmid is called 
pMH5. A schematic of the construction of pMH5 is shown in Figure 
37- 

f iv^ Integration of pWHS into a ttB of M. smegmatis. 

Plasmids pyUBX2 (a gift from Dr. Bill Jacobs, a schematic of 
the formation of which is shown in Figure 1), pMDOl (Figure 38), 
and pHH5 were electroporated, with four different concentrations 
of plasmid DNA over a 1,000-fold range, into M. smegmatis strain 
mc^l55, a strain which is able to support plasmid replication, 
in sections (iv) through (vi), all electroporation procedures of 
M. sm eg matis , or of BCG, were carried out as follows: 

Cultures of organism were grown in Middlebrook 7H9 media, as 
described by Snapper, et al. (1988), harvested by centrifugation, 
washed three times with cold 10% glycerol, and resuspended at 
approximately a 100 x concentration of cells. 

1 jxl of DNA was added to 100 nl of cells in an ice-cold 
cuvette and pulsed in a Bio-Rad Gene Pulser, and given a single 
pulse at 1.25 kv at 25 nF. 1 ml of broth was added the cells 
incubated for 1 hr. at 37'»C for expression of the 
antibiotic-resistant marker. Cells were then concentrated and 
plated out on Middlebrook or tryptic soy media containing 15 
|ig/ml kanamycin. Colonies were observed after 3 to 5 days 
incubation at 37**C. 



wo 93/07897 



31 



PCT/US92/09075 



Each of pYUB12, pMDOl, and pMH5 carries kanamycin 
resistance. Plasmid pYUB12 carries an origin of DNA replication, 
while pMDOl lacks a mycobacterial origin of replication. Plasmid 
pMH5 does not* carry a mycobacterial origin of replication, but 
carries a 2kb region of phage L5 which contains the attP site and 
the integrase gene (Figure 31). The number of transf ormants were 

linear with DNA concentration, Plasmid pyUB12 gives a large 

5 2 
number of transf ormants (2 x 10 per \xg DNA) in mc 155, while 

4 

pMH5 gives 6 x 10 transf ormants per |ig DNA, and pMDOl gives no 
transf ormants . 

The above experiment was then repeated by electroporating 

the plasmids pYUB12, pMDOl, and pMH5 into M. smegmatis strain 
2 

mc 6, which does not support plasmid replication. No 

2 

transf ormants in mc 6 were obtained from pYUB12 or pMDOl, while 

4 

pMH5 gave approximately 10 kanamycin resistant transf ormants in 
2 

mc 6 per ^g of DNA, thus indicating integration of pMH5 into the 
mc^6 chromosome. 

2 

DNA from six independent pMHS transf ormants (four in mc 155 
2 

and two in mc 6) was prepared. These DNA's (along with DNA from 

2 2 
both mc 155 itself, and mc 155 carrying the plasmid pyUBI2) were 

digested with a restriction enzyme, and analyzed by Southern blot 

and hybridization with the M. smeqmatis 1.9kb attB probe 

hereinabove described. As shown in Figure 39, all six 

transformants have integrated into the attB site, resulting in 

the production of two new DNA fragments with different 

mobilities. If pMH5 did not integrate into the attB site, it 

would be expected that a single band, corresponding to the attB 
2 

site in the mc 155 control, would be obtained. 
{ V ) Construction of pMH9.2 and pMH9.4 

pUC119 was digested with Hindi II, and a Ikb Hindi I I 
fragment, containing a kanamycin resistance gene, purified from 
pKD43, was ligated to the Hindlll digested pUC119 to form pMH8 
(Figure 40). A 2kb Sail fragment (bp 3226-5310), which carries 
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the attP and integrase gene from Sail digested pMH5, was purified 
and inserted in both orientations relative to the vector backbone 
of sail digested pMH8 to form plasmids pMH9.2 and pMH9.4 (Figures 

41 and 42). " ' ^ 

M. smeqmatis strain mc^lSS cells carrying, as a result of 
el-ctroporation, plasmid pYUB12, pMH9.2 or pMH9.4, or strain mc 6 
cells carrying plasmid pMH5. as a result of electroporation as 
hereinabove described, were grown to saturation in broth wxth 
kanamycin. Cultures were then diluted 1.100 into broth without 
kanamycin and grown to saturation. Two further cycles of 
dilution and growth were done, corresponding to about 20 
generations of bacterial growth. Cultures were plated out to 
single colonies on non-selective plates, and approximately 100 of 
these colonies were patch plated onto both non- selective and 
selective plates. The % of colonies that were sensitive to 
kanamycin, thus corresponding to the percentage of cells which 
lost the plasmid, is given below in Table I, 

Table I 
% loss 

PYUB12 (mc^l55) 35 

pMH5 (mc^6) 17 

pMH9.2 (mc^l55) 3 

pMH9.4 (mc^l55) 0 
■Cy?-)- Tra nsformation of BC G with r>MH9.4 

The 1.9 kb Sal I fragment, which includes the M, smeqmatis 
attB site as hereinabove described was cloned into pUCll9, and 
the plasmid generated was named pMH-12. (Figure 43). 

Gel purified Sal I 1.9kb M. ameamatis fragment containing 
attB (isolated from pMH-12) was used to probe a Southern transfer 
of Bam HI digested mycobacterial DNA's, including BCG substrain 
Pasteur, shown in Figure 44. This demonstrated that there is one 
Bam HI fragment of BCG that strongly hybridizes to the VL 
smeqmatis attB probe and three hybridize weakly. The strongest 
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hybridizing band is the fastest moving band (approximately 1,9 
kb). 

The same probe as above was used to probe a BCG cosmid 
library (provided by Dr. Bill Jacobs) and positive clones were 
identified. DNA was prepared from several positive clones and 
analyzed by restriction analysis and Southern blotting. The 1.9 
kb Bam HI fragment (corresponding to the strongly hybridizing 
band in the Southern blot was identified, gel purified from the 
cosmid DNA and cloned into pUC119. The resulting plasmid was 
named pMH-15. (Figure 45). 

Plasmid pMH-5 and pMH9 . 4 were electroporated into BCG 

Pasteur. It was observed that pMH9.4 transforms BCG with high 

4 

efficiency (approximately 10 transf ormants/fig DNA), while pMH-5 

transforms BCG at low efficiency (1-10 transf ormants/|ig DNA). 

DNA was prepared from BCG transf ormants and analyzed by Bam HI 

restriction and Southern blot analysis, probing with gel purified 

1.9kb Bam HI BCG attB fragment from pMH-15. These data are shown 

in Figure 41 and show that integration of both pMH5 and pMH9.4 is 

specific to the BCG attB site (ie. the strongly cross-hybridizing 

fragment in BCG). This is Illustrated by the loss of the 1.9kb 

Bam HI fragment from the transf ormants and the appearance of two 

new bands representing attL and attR junction fragments. Figure 

46 shows just one of the pMH5/BCG transf ormants, although all of 

the four that were analyzed show that one of the bands (the 

largest) is smaller than expected (and different in each of the 

transformants) , indicating that the transformation effiency of 

pMH-5 is low in BCG. In contrast, the four pMH9.4 transf ormants 

are identical to each other (Figure 46) and give attR and attL 

junction fragments of the predicted sizes. 

Plasmid pMV206 was digested with Not I to remove the 

mycobacterial replicon. The resulting 2209 bp fragment, which 

R 

includes the aph (Kan ) gene, the E. coli replicon and the 
multiple cloning site, was ligated and recircularized to form 
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PMV205, the construction of which is schematically depicted in 
Figure 11. 

PGR with primers Xbal-Att/Int and Nhel-Att/Int was then 
performed on a Sal I fragment from pMH9.4, which contains the 
attP site and the L5 integrase gene. The resulting cassette was 
then digested with Xhal and Nhel and a 1789 bp fragment was gel 
purified. PM7205 was then digested with Nhel, and the resulting 
fragment was ligated to the 1989 bp fragment obtained from pMH9.4 
to form PMV306. A schematic of the construction of pMV306 is 

shown in Figure 47. 

p2638::OspA (from Example 6) and pMV306 were each digested 
with Xbal and Sail. The Xbal-Sall fragment of p2638:OspA, which 
contains the HSP60 promoter, 38 kda secretion signal sequence, 
and OspA antigen sequence, was ligated into Xbal and Sail 
digested pMV30e to form p3 638 :t OspA. 

Example 8 

pRB26 was constructed as described in Example 1. The 32 kda 
-antigen gene of M. tuberculosis or BCG (Matsuo, et al. , 
Bacteripl, Vol. 170, No. 9, pgs 3847-3854 (Sept. 1988); 
Borremans, et al.. Infect, an d Immun. , Vol. 57, No. 10, pgs. 
3123-3130 (Oct. 1989)) was obtained from BCG chromosomal DNA and 
amplified by PGR using primers including added BgllI-BamHI:EcoRI 
sites. The PGR fragment, 420 bp in length (Figure 48), was 
digested with Bglll and EcoRI, and ligated into BamHI and EcoRI 
digested pRB26 to form pAB261 (Figure 49), which contains the 
enti re tfC -antigen gene. pAB261 was then digested with BamHI and 
Sail, and the 7aObp PGR OspA cassette hereinabove described in 
Example 1, was also digested with BamHI and Sail, and was ligated 
to BamHI and Sail digested pAB261 to form pAB261: :OspA. 



Example 9 
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Plasmid pMV206 was constructed as hereinabove described in 
Example 1. 

A partial sequence of the 5' region of the ©CG HSP70 gene 
(which encodes the BCG HSP70 heat shock protein, also known as 
the 70 kda antigen) obtained by Dr. Raju Lat^igra (Medical 
Research Council, London) permitted the construction of a 
cassette carrying the promoter sequence. The HSP70 promoter was 
amplified by PGR with primers including Xba and Nhel sites. The 
HSP70 promoter PGR fragment, 121 bp in length (Figure 50), was 
dige^sted with Xbal and Nhel, and ligated to Xbal digested pMV206 
to form pRB27. (Figure 51.) The 32 kda oC -antigen gene of BGG 
was obtained from BCG chromosomal DNA as described in Example 8, 
and amplified by PGR using primers including added 
Bgll I-BamHI :EcoRI sites. The PGR fragment was -digested with 
Bglll and EcoRI, and ligated into BamHI and EcoRI digested pRB27 
to form pAB271 (Figure 52), which contains the entire t;^: -antigen 
gene. pAB271 was then digested with BamHI and Sail, and the 
780bp PGR QspA cassette hereinabove described in Example 1, was 
also digested with BamHI and Sall^ and was ligated to BamHI and 
Sail digested pAB271 to form pAB271 : : OspA* 

Example 10 

Vectors pl9PS: :OspA, p38PS: :OspA, pMV261: :OspA, and 
pMV251::0spA were transformed into BGG. The transformed BGG 
cells were cultured, and the cells were then sedimented from the 
cultures. The cells were then suspended in phosphate buffered 
saline (PBS), and cell suspensions were normalized to equivalent 
densities. The cells were disrupted by sonication, the cell 
envelopes were sedimented, and the supernatant (a 
Gyto.«3ol-enriched fraction) was saved. The cell envelopes were 
resuspended in PBS, and membranes were solubilized at 4°G by the 
addition of Triton X-114 to 2% (vol./vol.). Insoluble material 
(a cell wall-enriched fraction) was sedimented, and the 
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supernatant (membrane- enriched fraction) was removed- Triton 
X-114 was added to the Cyto sol -enriched fraction. After brief 
warming of the Triton X-114 solutions at 37°C, separation of 
aqueous and" detergent phases was achieved by a short 
centrifugation- These two phases were back-extracted three 
times, and proteins in representative samples were precipitated 
by the addition of acetone. A portion of each culture 
supernatant was concentrated by an ultrafiltration device 
(Centricon-30, Amicon) . Samples representing culture volume 
eqt^ivalents were processed by SDS-PAGE, transferred to 
nitrocellulose, and Western blotted with anti-OspA monoclonal 
antibody (Mab) H5332. (Howe, et ai. , Infect, and Immun. , Vol. 
54, No. 1, pgs. 207-212 (Oct. 1986)), Ellter-bound antibody was 
visualized with an enhanced chemi luminescence . system (Amersham) . 
As shown in Figure 53, Lane 1 is a molecular weight standard 
(Rainbow Markers, Amersham); lane 2 is a whole cell sonicate 
fraction; lane 3 is Triton X-114 insoluble material; lane 4 is 
the aqueous phase membrane fraction; lane 5 is the detergent 
phase membrane fraction; lane 6 is the aqueous phase Cytosol 
fraction; lane 7 is the detergent phase Cytosol fraction; and 
lane 8 is a concentrated culture medium. 

As can be seen from Figure 53, recombinant chimeric OspA 
fusion proteins expressed from the expression vectors pl9PS::0spA 
and p38PS::0spA were found to be localized predominantly in the 
Triton X-114 phase from the membrane fractions, thus suggesting 
that these recombinant OspA proteins were fused to the 
mycobacterial 19 kda and 38 kda secretion signals, which directed 
secretion and post-translational processing by fatty acylation at 
an N-terminal cysteine. OspA expressed with its native 
lipoprotein signal peptide by pMV251ttOspA was found to be 
localized in detergent soluble BCG membrane fractions although 
additional OspA was also found in BCG cytoplasmic aqueous 
fractions, thus suggesting that the OspA signal was not as 
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eff icien1:ly processed in BCG as were the 19 kda and 38 kda signal 
sequences. Recombinant OspA expressed by pMV261: rOspA, wherein 
OspA was not fused to a lipoprotein signal, was found to be 
localized only in aqueous cytoplasmic fractions. 

Example 11 

BCG cells were transformed with either pAB261: j OspA, 
pAB271: :OspA, or pMV261::OspA and cultured. Portions of BCG 
culture supernatants were depleted of bovine serum albumin (BSA), 
a component of the medium, by adsorption with Affi-gel Blue (Bio 
Rad) . BCG cell pellets from the cultures were suspensed in PBS 
and sonicated. Adsorbed or unadsorbed supernatants were 
concentrated (Centricon 30) and then diluted to the same relative 
concentration, on a culture volume basis, as the lysed cells. 
Samples were used for SDS-PAGE and subsequent immunob lotting with 
anti-OspA (Mab H 5332), anti-Hsp70 (Mab IT-41, WHO mycobacterial 
monoclonal antibody bank), or anti-Hsp60 (Mab IT- 13, WHO 
mycobacterial monoclonal antibody bank) . As shown in Figure 54, 
lane M.W. Std. is a molecular weight standard, lanes W are whole 
cell lysates, lanes S are culture supernatants (unadsorbed), and 
lanes A are adsorbed supernatants. As shown in Figure 54, it was 
determined that fusion of the OspA gene, without the secretion 
signal, to the complete o- antigen gene resulted in high level 
expression, and a substantial fraction of the resulting 
recombinant a-antigen-OspA fusion protein was found to be 
excreted into the culture media. The absence of detectable 
quantities of cytoplasmic proteins (Hsp60 and Hsp70) in the 
supernatant Indicated that cell lysis was minimal, and that the 
recombinant «-antigen: rOspA fusion protein was specifically 
targeted to be secreted and is not simply found in culture 
supernatants due to autolysis. 
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Example 12 

BCG organisms (Pasteur strain) were transformed with one of 
the following vectors s 
pMV261 : : OspA 
pMV251: :OspA 
p2619;:OspA 
pl9PSr:OspA 
p38PSr J OspA 
p3638t tOspA 
pAB261::OspA 

AS a negative control, pMV261/LZ was used to transform a 
control group of organisms. pMV261/LZ was constructed by cloning 
a BamHI restriction fragment carrying the E^coli lacZ gene (which 
encodes B-galactosidase) into the BamHI site of Bam HI digested 
pMV261. 

The transformed BCG colonies were isolated by selection for 
kamamycin resistance and expanded in liquid media culture for 
further analysis. Recombinant BCG samples representing culture 
volume equivalents were processed by SDS-PAGE, transferred to 
nitrocellulose, and Western blotted with anti-OspA Mab H5332. 
Positive controls employed were a processed sample of 
B.burgdo.rferi strain B31, and samples of OspA antigens in 
concentrations of 500 ug/ml, 100 ng/ml. and 20 ng/ml. The 
filter-bound antibody was visualized with an enhanced 
chemilumineseuce system (Amersham) . As shown in Figure 55, OspA 
was expressed by BCG transformed with vectors including the OspA 
gene. Figure 55 also shows the expression of a fusion protein of 
OspA and a mycobacterial secretion signal by BCG transofmred with 
p2619::OspA, pl9PSi tOspA, p38PS::OspA, or p3 638: : OspA. 

Example 13 

BCG organisms were transformed with pHV261: :OspA, and the 
transformed organisms were cultured. Twenty-four different 
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strains of mice, with five mice representing each strain, were 
immunized with a single dose of 1 x 10^ CFU of BCG transformed 
with pIW261::OspA (post freeze titer of 42%) intraperitoneally . 
The mice were bled every four weeks for 16 weeks, and also at 19 
weeks. Sera were analyzed by ELISA on whole cells of Borrelia 
burg do rferi and BCG lysate coated on wells. The reaction was 
developed with peroxidase conjugated anti-mouse immunoglobulin 
and substrate. Color development was read as absorbance at 
405nm, Positive sera had optical density (O.D. ) values at three 
standard deviations above the mean of the prebleed sera. At 17 
weeks, the mice were given a booster intraperitoneal injection of 
1 X 10^ CFU of BCG transformed with pMV261 : rOspA. 

As shown in Figures 55 and 57, the following strains: 

A/HeJ 

A/J 

AKR/J 

BALB/cByJ 

CBA/J 

C3H/HeJ 

SJL/J 

LP/J 

129/J 

CE/J 

BlO.BR/SgSnJ 

D4 Swiss Webster 

Senear 

FVB 

showed an immune response after a single immunization, and the 
following strains: 

A/HeJ 

A/J 

C3H/HeJ 
129/J 
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CE/J 

BlO.BR/SgSnJ 

D4 Swiss Webster 

had responded- With significantly high levels. of antibody against 
Rn-rirg^ll a burgdorferi . 

Rxamole 14 

BCG organisms transformed with either pMV261 . :OspA, 
pAB261::OspA, pl9PS.:OspA, or pMV251 : = OspA, plus non- recombinant 
BCG Pasteur organisms were subjected to cell fractionation and 
Triton X-114 detergent phase partitioning analysis CBordier, et 
al. -T . Biol. Chem. , Vol. 256, pg, 1604 (1981); Radolf, et al., 
infect _and_^inrnun^. Vol. 56, pg. 490 (1988)) to determine if 
expression of OspA genes in the vectors hereinabove described 
resulted in export and lipid acylation of recombinant OspA 
protein. 

Recombinant BCG cells wee sedimented from BCG cultures, 
suspended in phosphate buffered saline (PBS), and cell 
suspensions were adjusted to equivalent densities. Cells ^ were 
disrvipted by sonication and mebranes were solubilized at 4°C by 
the addition of Triton X-114 to 2% {vol./vol.). Insoluble 
material (cell wall enriched fraction) was centrifuged, and the 
supernatant was subjected to detergent phase partitioning. After 
briefly warming (37°C) the Triton X-114 solutions, separation of 
aqueous and detergent phases was achieved by a short 
centrifuguation . The two phases were back- extracted three 
times, and proteins in representative samples were precipitated 
by the addition of acetone. A portion of each culture 
supernatant was concentrated by ultrafiltration. Samples 
representing 5- fold concentrated culture volume equivalents were 
processed by SDS-PAGE, transferred to nitrocellulose and blotted 
with anti-OspA MAb H5332. (Figure 58). Similar fractions from 
non-recombinant BCG were blotted with appropriate monoclonal 
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antibodies specific for the BCG or M> tuberculosi s Hsp60 protein 
(IT13), a antigen (HYT27), or M. tuberculosis 19kda antigen (HYT6) 
to determine the cellular location of the native fusion partners. 
As shown in Figure 58, lane W is a whole cell sonicate fraction; 
lane I is a Triton X-114 insoluble cell wall enriched fraction; 
lane A is a cytosol-enriched aqueous fraction; lane D is a 
detergent phase (membrane-enriched) fraction and lane M is a 
5-fold concentrated culture medium fraction. 

As shown in Figure 58, the OspA gene product encoded by 
pMV261::OspA was found excessively in the aqueous cytosolic 
fraction (lane A) and correlated with the exclusive cytoplasmic 
location of HSP60. The a -anti gen- OspA gene product expressed by 
pAB261::OspA and the native BCG a-antigen were found in the 
insoluble cell wall enriched fraction (lane I), aqueous cytosolic 
fraction (lane A), and media fraction (lane M) , but not in the 
detergent soluble lipoprotein-enriched fraction (lane D). The 
presence of the a-antigen in the recombinant BCG culture media 
was not due to recombinant BCG autolysis, as HSP60 was not found 
in the culture media. Compared to the native BCG a-antigen, a 
substantially smaller fraction of the fusion protein expressed by 
pAB261::OspA was secreted into the media, while a larger portion 
was found in the cell wall enriched insoluble fraction. This 
suggests that fusion to the a-antigen could also direct foreign 
antigens to the cell wall. Substitution of the M. tuberculosis 
19kda antigen signal peptide for the OspA signal peptide resulted 
in expression of a chimeric OspA protein that was located almost 
exclusively in the detergent soluble fraction. This finding 
indicated that fusion of the M. tuberculosis 19kda antigen signal 
peptide to OspA did direct efficient expression and export of the 
OspA protein to the membrane of BCG, This result was in contrast 
to the product expressed by organisms transformed with 
pMV251 : :OspA, where most of OspA was found in the aqueous 
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frar-^ion, which may have been due to inefficient processing of 
the native Borrelia signal peptide. 

Example 15 

The recombinant BCG organisms of Example 14 were analyzed by 
flow cytometry to determine if the recombinant OspA gene products 
were accessible on the surface of recombinant BCG to anti-OspA 
antibody. 

Approximately 2 x 10^ recombinant BCG organisms grown in 
Dubos media supplemented with albumin-dextrose complex and 0.05% 
Tween 80 were harvested by centrifugation. The pelleted 
recombinant BCG organisms were washed with 10 ml. of phosphate 
buffered saline (pH 7.4) containing 0.05% Tween 80 (PBS-TSO) , 
resuspended in 5 ml. PBS-T80, and fixed for 10 minutes in 2% 
paraformaldehyde. Fixed recombinant BCG organisms were pelleted 
and washed twice with 5 ml. PBS-T80, and then resuspended in 1 
ml. of PBS-T80. Polyclonal rabbit sera specific for OspA 
(BCG-adsorbed) was added to the fixed recombinant BCG cell 
suspension to a final dilution of 1:200 and incubated for 30 
minutes at room temperature and 30 minutes on ice. The 
suspension was then pellted by centrifugation, washed twice with 
0.5 ml. PBS-T80 and resuspended in 1 ml. PBS-T80. Goat 
anti-rabbit FITC-conjugated secondary antibody was added to a 
final dilution of 1:50 and incubated for 30 minutes on ice. The 
recombinant BCG-secondary antibody suspension was pelleted by 
centrifugation, washed twice with 1 ml. PBS-T80 and resuspended 
in 2 ml. PBS-T80. Labeled recombinant BCG were mildly sonicated 
to disperse clumped cells and dilutions were analyzed by flow 
cytometry on an FACS scan (Becton-Dickinson) . Recombinant BCG 
containing the designated plasraids and expressing the designated 
chimeric OspA gene products are compared to non- recombinant BCG. 
(Figure 59). 
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As shown in Figure 59, recombinant BCG organisms expressing 
OspA from plasraids pl9PS:OspA, pMV251 : : OspA, and pAB261 : : OspA, 
all demonstrated increased surface fluorescence with anti-OspA 
sera when compared with non- recombinant BCG or recombinant BCG 
expressing OspA from plasmid pMV261 : : OspA. The relative surface 
fluorescence exhibited by expression of OspA from organisms 
transformed with pMV251::0spA was less than that observed for 
organisms transformed with pl9PS::0spA, and was in agreement with 
the fractionation analysis of Example 14. The recombinant BCG 
expressing OspA from pAB261::OspA also exhibited surface 
f lourescence, thus confirming that the a -anti gen- OspA fusion 
protein found in the Triton insoluble fraction (Example 14) was 
cell wall associated and not derived from insoluble inclusion 
bodies. Therefore, it was possible to export OspA to the surface 
of BCG as a membrane-associated lipoprotein by fusion to the M. 
tuberc ulosis 19kda antigen signal sequence, or as a secreted and 
cell wall associated protein by fusion to the a-antigen. 

Example 16 

C3H/He, BALB/C, and Swiss Webster mice were immunized with 
10 colony forming units of BCG organisms transformed with 
pMV261 : :OspA, pMV251 : : OspA, pl9PS::0spA, pAB261 : : OspA , or of 
non- recombinant BCG Pasteur. The mice were given a booster of 
the identical dose at 16 weeks. As shown in Figure 60, all three 
mouse strains immunized with BCG transformed with pMV251::OspA or 
pl9PS: :OspA exhibited strong OspA- specif ic antibody responses 
within 4 to 8 weeks after a single immunization as measured by 
ELISA to whole Borrelia organisms or purified OspA. Particularly 
striking were the anti-OspA responses elicited by a single dose 
of BCG organisms transformed with either pMV251::OspA or 
pl9PS::0spA; in the low responder Swiss Webster strain; the same 
strain of mice immunized with BCG transformed with pMV261::OspA 
or pAB261::OspA did not mount anti-OspA responses even after 
boosting. Peak anti-OspA antibody titers exceeding 1:10^ in 
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BKLB/C and C3H/He mice, and 1:10^ in Swiss Webster mice were 
elicited by boosting with BCG transformed with pMV251::OspA or 
pl9PS:.OspA, and these responses were 100 to 1,000-fold higher 
than the responses induced with BCG transformed with pMV261::OspA 
or pAB261: tOspA. 

Example 17 

Immune sera from the immunized C3H/He and BALB/C mice of 
Example 16 were analyzed for their ability to inhibit growth of 
the non-pathogenic B31 laboratory strain of B. burdorferi in 
culture in two independent experiments. (Sadziene, et al., 
Tnfe ct- Diseases , in press (1992). Growth inhibition titers for 
each of the immune sera are given in Table I below: 

Table I 



Mouse 


Vector 




Titer 




Strain 




Exoeriment 1 




ExTseriment 2 


BALB/C 


pMV261j lOspA 


<8 




N/A 


BALB/C 


pM7251: lOspA 


4096 




8924 


BALB/C 


pl9PS: :OspA 


1024 




16384 


BALB/C 


pAB261i lOspA 


N/A 




N/A 


BALB/C 


none (Control) 


<8 




<S 


C3H/He 


pMV261: tOspA 


32 




N/A 


C3H/He 


pMV25Tt rOspA 


1024 




32768 


C3H/He 


pl9PS: :OspA 


2048 




16384 


C3H/He 


pAB261i :OspA 


256 




N/A 


C3H/He 


none (Control) 


<8 




<8 


The 


above results show 


that antisera 


obtained from mice 


immunized 


with BCG transformed with pMV25l5 


tOspA or 


pl9PS: :OspA 



mice immunized with BCG transformed with pMV261s:0spA showed 
lower or undetectable growth inhibition titers. 

C3H/He and BALB/C mice immunized with the BCG organisms 
hereinabove described were then challenged with either 
lO^B. burgdorferi strain Sh^ organisms intraperitoneally (IP) or 
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10 organisms intradermally (ID). The B . burgdorferi organisms 

6 

were administered 5 weeks after a booster immunization of 10 
transformed BCG organisms. The mice were sacrificed 14 days 
after the B , burgdorferi challenge, and plasma, and bladder tissue 
were cultured in BSKII media. (Schwan, et al., J. Clin. 
Microb iol , Vol. 20, pg. 155 (1984)). Cultures were monitored 
through day 14 by phase contrast microscopy for the presence of 
spirochetes. The presence of one or more spirochetes per 20 high 
power fields in any one of the plasma or tissue cultures was 
scored as an infection. The fraction of the challenged mice 
exhibiting positive infections in the IP, and ID challenges are 



given in 


Table II below. 


Table II 




Mouse 


Vector 


No, of Infections 




Sjtjrain 




IP 


ID 


BALB/C 


pMV261: :OspA 


5/5 


N/A 


BALB/C 


pMV251: :OspA 


0/5 


0/5 


BALB/C 


pl9PS: sOspA 


0/5 


0/5 


BALB/C 


pAB261 : :OspA 


4/5 


N/A 


BALB/C 


none ( Cont ro 1 ) 


4/4 


4/4 


C3H/He 


pMV261: :OspA 


3/4 


N/A 


C3H/He 


pMV251 : :OspA 


0/5 


0/5 


C3H/He 


pl9PS: :OspA 


3/5 


0/5 


C3H/He 


pAB261 : :OspA 


3/5 


N/A 


C3H/He 


none ( Contro 1 ) 


5/5 


5/5 


The 


above results show 


that all control mice 


were 



be infected, whereas the mice that were immunized with BCG 
transformed with pMV251::OspA or pl9PS::0spA were protected from 
infection. 

It is to be understood however, that the scope of the 
present invention is not to be limited to the specific 
embodiments described above. The invention may be practiced 
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other than as particularly described and still be within the 
scope of the accompanying claims. 
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WHAT IS CLAIMED IS! 

1. An expression vector for expressing a protein or 
polypeptide- or peptide in a bacterium, comprising: 

a first DNA sequence encoding at least a secretion signal of 
a lipoprotein and a second DNA sequence encoding a protein or 
fragment thereof, or polypeptide or peptide heterologous to the 
bacterium which expresses the protein or fragment thereof, or 
polypeptide or peptide, whereby said bacterium expresses a fusion 
protein of a lipoprotein or lipoprotein segment and said protein 
or fragment thereof, or polypeptide or peptide heterologous to 
the bacterium which expresses the protein or polypeptide or 
peptide, 

2. The expression vector of Claim 1 wherein the bacterium 
is a mycobacterium. 

3. The expression vector of Claim 1 wherein said first DNA 
sequence encodes at least a secretion signal of a mycobacterial 
lipoprotein, 

4. The expression vector of Claim 3 wherein said 
mycobacterial lipoprotein is an M. tuberculosis lipoprotein. 

5. The expression vector of Claim 4 wherein said M. 
tubercul osis lipoprotein is selected from the group consisting of 
the 19 kda and 38 kda antigens. 

6. The expression vector of Claim 2 wherein said vector 
further comprises a mycobacterial origin of replication . 

7. The expression vector of Claim 2 wherein said vector 
further comprises a DNA sequence encoding mycobacteriophage 
integration into a mycobacterium chromosome. 

8. The vector of Claim 1 wherein said protein or fragment 
thereof, or polypeptide or peptide heterologous to the bacterium 
which expresses the protein or fragment thereof, or polypeptide 
or peptide is the PspA antigen of Streptococcus pneumoniae or a 
fragment or derivative thereof. 

9. A mycobacterium transformed with the vector of Claim 2. 



wo 93/07897 



48 



PCr/US92/09075 



10. The transformed mycobacterium of Claim 9 wherein the 

mycobacterium is BCG. 

11. A pharmaceutical composition comprising: 

the mycobacterium of Claim 9; and 

an acceptable pharmaceutical carrier. 

12. The expression vector of Claim 1 wherein said vector is 

a plasmid. 

13. The vector of Claim 12 wherein the vector is a shuttle 
plasmid, and further comprises a bacterial origin of replication. 

14. A method of protecting an animal against Lyme disease, 

comprising: 

administering to an animal mycobacteria transformed with DNA 
which includes at least one DNA sequence which encodes a protein 
or polypeptide which elicits antibodies against Borrelia 
burgdorferi, said mycobacteria being administered in an amount 
effective to protect an animal against Lyme disease. 

15. The method of Claim 14 wherein said at least one DNA 
sequence encodes a surface protein of Borrelia burgdorferi or a 
fragment or derivative thereof. 

16. The method of Claim 15 wherein said surface protein of 
Borrelia_burgdorferi is selected from the group consisting of 
(Xiter Surface Protein A and Outer Surface Protein B. 

17. The method of Claim 14 wherein said mycobacteria are of 

the species M. bovis -BCG. 

18. A composition for protecting an animal against Lyme 

disease, comprising: 

mycobacteria transformed with DNA which includes at least 
one DMA sequence which encodes a protein or polypeptide which 
elicits antibodies against Borrelia burgdorferi; and 

an acceptable pharmaceutical carrier, said mycobacteria 
being present in an amount effective to protect an animal against 
Lyme disease. 
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19. The .oai^Kjsition of Claim 18 wherein said at least one 
DNA sequence eiasrodes a surface protein of Borrella burcydorferi or 
a fragment or .derivative thereof. 

20- The c(DiB?>osition of Claim 19 wherein said surface 
protein of Borrelia burgdorferi is selected from the group 
consisting of Oaater Surface Protein A and Outer Surface Protein 
B. 

21. The exposition of Claim 18 wherein said mycobacteria 
are of the species bovis -BCG. 

22. An «qpression vector for expressing a protein or 
polypeptide in a bacterium which includes a DNA segvience encoding 
at least a seciDetion signal of a lipoprotein. 
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GAGCnxnCAAATianCGGCCAanGGCICACXXTroCT 

- + - * ♦ + +2900 

CTCXiACG^GTITAAGCAGOOGCIGCAOCGAGTGOGAAOCATCATClXXn^ 
LQE FED AVMSVS PLLG RN 
'TGAGCCGCCACCOCACAA(nXXL4CACTtXXX^ ^ 

. 4. ......... 4. ......... 4. ......... .f ......... 4. ......... 4. 3gQQ 

ACIXXXXGGItXXXntnriGACGTGnSAGGGGGCXjAGAGGGCAOC^^ 

LPWGVVACEGARGDLGS 
CAGGAGGAACACATCXXntXrriTOGAGGAOCTITroOGGGC^^ 

GltXTOrntTIXTrACGCAGCAAAGCnXltXLiAAGGC^^ 
V L L F V 

CIGAATCOXXXKnVUXUGODkCACAGCACXXXUACIT^^ 

. + + .4.. . .. -...-■♦•---»---- - 4- 32D0 

cx^wcTrAocxxxxx:ATtK,Ma;o'io'ii;ico'iGGGCTiGArfii^^ 

M.Rp«Mla 



GAG(7IGAGATAGGaxnACItACG(nTXX:AAGGGCGACACA(XXX^^ 

+ + + + - + - + 3300 

CItXLiCTCTATG(XX)GA1XSAG1GCGACm[TC^^ 

AOIAGCAGGIUrnXUGGCTIGGCaaUAGIGCAGG&CAIOC^^ 
■'^••■•■•••■^•■■•■•••» ■■■•♦aann 

mOTeOT^GMQCTOOQA^ 

TltXnvSOCAGOGOGroGAGCXXnTAGAGGOCCIGOGGIGI^^ 

f + + + + +3SD0 

CAACCAOCnO i COCAGCiXXXXJUaXnXXXXSCA^ 

CITGTCCAAGGCritTrATCTACGCTrAGTCCAAAC 

+ + + --- + +3«0 

CGAACAGG11XXX:ACATAGATCa«AATCAGGITIi:AAGITIGC^^ 

l€GGT(^TGACG(7rcAAAACCTCTBACACATGCAGCiarGG 

+ * + + - — + 37D0 

AGCCACTACTCKXACTITroCACLiCnJItTrACXriX^ 

(nCAGCOGGtUrWCCOGGIGtCaCOOCXSCAau^ 

......... 4.......... •4.. 4..«.......4.380O 

AGTTCGCOCACAACXXXXXIACACCCXXXXXntXXnACIXXXT^ 
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I,c,cL.M 

ACHGAGAGIGCAaiATATOCGGIGTCUAATAGCXX:^ 

+ ♦ ••••*• --+3900 

TCMCIXnXZACGTGGTATAaXXIACACTmTOGCXnGICI^^ 

OGCTCGGT0GTIVXXX:iGOBGGCaAGOGGIATCA(XnCA(nC^^ 

+ ^.........^ . + +4000 

OCGAGCOiGCXkGCOIGATrATimaiATCGICrAGCGAUm 



•*■ '* + +4100 

GTri ' ltX:GGMLVnTimAf ' ILVn^ GC ATIlTimX X^^ 

AAGTCAGAGG1XXXX:AAAOCCXUCAGGACrATAAAGATACX:AGGO^^ 
+ ......... + ...... ^.........4.42)0 

cxxaiTAa:iuiuxxx:ni ouxTiLxx x«AAGCxnxxxxx iTiu 
....^...•.....^•.••.••••^.••.••••.^.•••..•.•^..•.••••.^4300 

GOCTATCXMCAGGCGGAAAGAOGGAAGCCXTiaXZAGOC^^ 
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Fl O. © 



.Mu- , e>.n-y l46BdmH I- 
•Nhe I 6407 194 Nar I 

r423 Mac I- 



Aat II 4915 




1307 Bal I 



•AtwN I 4453 

•Nde i 3864 

•Spl I 3217 

Di<je«* withNarl and Ball, 
Fill in, Ligale/Recircularize 

MK . «oc)L l46BamH I 
-Nhe i 5285 |64 Nar I- 



845 Nco i- 
1071 EcoR I 




1967 Nca I- 

2193 EcoR 1- 
2431 Bgl II- 
2511 Sari- 



Digest withNdei and Spll, 
Fill in, Ligate /Recircuia r ize 



Mi. • KTc^ BamH I- 
Nhei 5763 ie4Narl- 

,423iy/lacl- 



I309 Bgl II 
1389 Sal 

Att 11 4271 



AlwN I 3608 
2096 Spll- 



•Nde I 2742 



Oigesi with BamHtaEcoRI 
and isolate 1071 bp fragment 




I307 Bal I* 



1967 Nco I- 

i2l93EcoR i 
2431 Bgl H 
2511301 1 



Aot 11 3149 

-AiwN I 2867 



p-Ligate -^Digest witli BamHI a EcoR I 

1 and isolate. 3570 bp fragment 

I /iR/u W6 BomH I- 
•Nhe I 4641 .194 Nor |. 



845 Nco i- 
1071 EcoR I- 

1309 Bgl il- 
1389 Sa I I- 
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F 1 o. y 



•Aat If 3149 



IMnel484l Nor,. 



AiwNl 2687 
•ApaL I 2590 




PCR mutagenesis 
ofEcoRI 1071 



845Ncof- 
l07l£eoR(- 

1309 Bgf |j- 
1389 Sal I- 



PCR mutagenesis 
of Sail 1369 



•Nhe I 4841 'IS^?.'""' 



184 Nar I- 




845NC0I 



1309 Bgl II 
l389Sall- 

•Aat II 3149 



f46BamH I- 
184 Nor I: 

•Nh« I 4841 



AiwN i 2687 
•ApaL I 2590 



Digest with ApaLI* Bgllf. 
\%o\KiH 3360 bp fragment 





-AlwN 1 2687 
-ApaL I 2590 



Digest with ApoLf-t- Bgl II. 
Isolote 1281 bp fragment 



Nhe I 4fl4l 146 BamH I" 



•Aat II 3149 



AlwN I 2687 
•ApaL I 2590 
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wo 93/07897 



16/63 



PCr/US92/09075 



<1Zq 











Z 




lO"" 








o>o 




roo) 




—to 




• - 


gi 


3 
CL 


cap? 








o 
o 



09 



SUBSTfTUTE SHEET 



WO93/078»7 l>Cr/US92/0W75 



0) 



0 



IX. 



UJ 

(0 

O 
2 

Z 
O 
-J 

LJ 
-J 
Q. 

b 



o 

I- 
u 

X 

I- 

>- 
<0 



< u 

< o 



u 
< 



< 

E3 
< 

< 
u 

< 
o 



< 
o 
u 

< 

8 



u 
u 
o 

o 



8 



u 
u 

^ 8 



< 

< 



u o 

< 

< 



OHO 

u 
< 

OOP 



3 t; 



u 
u 



< 
< 



o o 



.< 



o 
o 



< 



o o 

< < 

< u 



< H U 

s a ^ 

^ o ^ 

o o < 

U tr p 

? s fe 



q: 

LU 

o 

03 



JO rJ 

..is 



<3 



.i3 



•5 



J- 



-Si 
-c: 



i 




SUBSTITUTE SHEET 



wo 93/07897 



PCT/US92/09075 



u 
CD 

_0)_ 

• S2 = 



1219 Spe I 



XWOO«5- 



= O a. 

c o— O 

X CO OCO- 

*in-. mo 

iS^Q. > CM- 
Oo 

wiJi s - 

lO o. 
0> o- 



O CL 
o) a> 

.9-8 

2m 



z 




I go— Co 





ffl 



0 

0 

iL 



Bgl II 
27873 



SUBSTITUTE SHEET 



wo 93/07897 



I»Cr/US92/09075 



•Bel I 4101 
Sfi I 4084 
Spl I 4002 
•Hpo I 3993 
•Sal I 3984 
•Cla I 3976 
•EcoR I 3966 
•Pst I 3960 
•Pvull 3957 
•BomH I 3952 
•Nco I 3948 
•Xba I 3938 
•Bglll3928 CNhei 
:^PxnX5?22, 
•Not I 3914, 



207 Nru I- 




•Bel I 4115 
Spl I 4016 
•Hpo I 4007 
•Sal I 3998 
. -Cla I 3992 
-HinD III 3986 
•EcoR I 3979 
•Pst I 3974 
•Pvu II 3971 
•BomH I 3966 
•NcQ I 3962 
•Xba I 3953 
•Dra I 3947 
■Bgl II 3942 
•Kpn I 3936 iNhal 
.Notl3928_ 



■Bel I 2044 

•Hpo I 2096 Digest Mlul Bgl II 2797 
•Sal I 2087 Ligate Mlul -NotI Linker 
•Cla I 2081 Recircularize 
•HinD III 2075 
•EcoR I 2068 
•Pst I 2063 
•Pvu II 2060 
•BamHI2055 n!««* w«*i 

•NC0I205I ?!««f^2°*' . 

•Xbal2042 Ligafe/Recirculorize 

•Dra 12036 
Bgl II 2031 
•KpnI2025 
•Not 1 2017 
•Mlu 1 2011 



INhel 





2011 Mlu I- 
20l7Notl- 
2025 Mlu 1 



Fl O. II 



•Spe I 1219 



SUBSTITUTE SHEET 



no. i2cx 

c 
I 

1 + + ♦ + +-.. 

CGATOtXnTXnTCCCTGCAACAC^CAGTITrAGAGACTACA^ 

KA N START CONDON 
ACAGTAATACAAGGGGTItTrrAZXiAGCCATATICAACOCXX^^ 

IIIJL* + + + .--.+... 

TCTCATTATCITO^CAC^TA^^ 

201 ATKXXKHtxioCATiaTCltXXXXIAAl^ 

TAOOCGAGOtXnXTTACAGaXXnTAGriXXIACGCTGTOGAT^ 

GTIXXXIAATGAlXnTACAGATGAGATGGIXIAGACTAAACnYXX^ 
301 + + + + + 

CAAOGGTTACTACAATtnCTACIXnViaiAGTCTGATTK^ 

CATGGTTACIGACCACKXXIATOCCOGGGAAAACAGGATIC^ 
401- - . + ...- + -- -- -- -- -♦.-.. ----.4... ....... 4... 

GTACCAATGAGrnXirrcACXXTrAGGGGCXXTriTItTI^^ 

, cxTOCGCxxxnrwLOTtxjATitxrnn^ 

501.... + + + ^. 

GGAOXXXXXIAAOCTrAAGCTAAGGACAAACATrAACAGGAAAATTCrrcG^ 

GmMT6CGL\OnUTlT]mi^C(MGOOTAAT(^^ 
60h --- + + + + 

CAACTACGCnXIACrAAAACrACnxnXXXLmCCGAO^ 

TQiClCATGGTGATmnx:ACITCATAACCmTlTITCA^^ 
701 + + + .- + -, 

AGr[YUGrACCACrAAAGAGTGAACTATnx;AATAAAAAC]XKnXX^ 

GGATOTGCCATCCrATtKJAACIXXXTCGCTrcAGTITTXn^^ 
801 + + + + ^. 

CCTACAACXXTTAGGATACClTGACGGAOCCACnXjlAAAGAGGAACT^ 

AACGTCAAAGTAAACT ACGAGC TACreAAAAAffATC^^ 

TltniXIAATAAATCGAACTITIXKnGAGTrGAAGCATCAGATCACGC^ 
1001 + + + + .. 

AACAACTTATITAGCTrcAAAAOGACTCAACTrcCTACKX^GT^^ 
AACItXnxrAOCnACAACAAAGCnnX:ATCAA(XXrit^^ 

1101 + + + + ... + ... 

XreAOIAGGTGGATOrrcTITCGAGAGTAGTIXXXIACXXJAGGGAGr^^ 
END KAN CASSETTE j_^^p.5 BEGIN E. RAP 

CCTCACGAGGCAGACXTrcACTAGTTCCACTGAGOGTCAGACCOO^ 

1201 - - - + + • 

GAAGTGCIXXXTICroGAGTGATCAAGGTGAClCGCACTXnXKXKX:^ 

CTKXaAACAAAAAAACCAOCGCTACCAGOCK;il X ; iill/I ' l 

1301 ^» 4 + + . + 

GAACXnTIXnrmriXXrrGGCGATGGTOGCCACCAAAC^ 

ATACCAAATACrcTCCnCTAGTXTTAGCCXnVlGriTAGGC^ 

MOl -H + + + 

TAIYXnTrATGACAG(XM\AGATCAC\TCGGCATCAATCCGGT^^ 
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FIO. 13 fc> 



AAGATAAAAATATATCATC47CAACaATAAAACTCnCnXnTA(^^ 

* + ♦ ■•- + 100 

TICTATnTTATATACTAGlSynTGmTIT^ 

GAGGOCGCGATrAAATItX:AA!CATCGATCXnGAT^ 

+ + + -- + + 200 

.<nCCGGCXKTAATITAAC»TlCTACCrACGACTAAATAT^ 

GGGAACOCCCATGCGCCAGAGTlVmClGAAACATaXIAAAGCTAGC 

+ + + + + 300 

ACCCTTCGGGGTAOGOXnCICAACAAAGACTmnAaXnTI^^ 

TITATGCCrcTTCCGACXZATCAilGCATITrAT^^ 

+ + + + + 400 

ATACCKUGAAGGCTCKnAGlTlXnrAAAATAGGCATGAAGGACrACT^ 

AGAATATCCTGATICAGGTCAAAATATnTITCATGOGCTGGCAGTGCT 

Cj + + + • + + 500 

(\J TCTTATAGGACTAACIXXaiClTITATAACAACTACGC^ 

~ GCCTATITCGlOCGiClCAGGCXKAATCACG^^ . 

^ + 4- + + 600 

(XXXIATAAAGCAGAGCGACICOCXXTITAGrcCTrACTr^ 

TtnXX;AAAGAAATGCATAA3CrrriG<XATlXnCAOCG^ 

♦ + 4- ... + + 700 

X GACC CniCl ' liA bTrATTAGAAAAOGGrAAGAGlXXXXTAAGTCAGC 

ATAGGTIXnAT]X:ATGTI«GACGACriXXK3AATCGCAGAO^^ 

+ + + + — - + 800 

TTATCX:AACATAACTACAACCIGCTCAGCXnTAGOGT^^ 

GAAACGGCTrnrCAAAAATATOCTATIXZATAATCXnX;^ 

+ ♦ + + + 900 

■< TxnritKXXJAAAAAGTrrmTACXIATAACTATrAGCACTATACCT^ 

CCrrGTAACACKXXAGAGtXrrACGCSCACnGACXS^ 

+ + + + + 1000 

CCAACATTGrrcACXGTCranAATGCGACTGAACTOCX:cnXX 
CCGACJiACGCAGACCCTICCGiCGCAAAGCAAAAGri^^ 

+ + ♦ + +1100 

GGCTGTnx:GTCTGGCAAG(X:AC0GTIT(X7nTICA^ 

GCTCGATGATGCKXXXATIC^GGCXriGGTATGAC^ 

!, + + --- + + 13D0 

oiACCTACTACOCaK^^ 

GATCAAAGGA lC7 ' iCUlU AGj au:iJlllii C i t»CXXXaAATCTCC^ 

....«... + -•-.----• + -•»•-••-•♦"••------+ 1300 

CTAGmCCTAGA/i^ 

GAGCTACCAACTCTrrnaXSAAGCTAACTGGCTl^ 

.4.. . + -- ------- + -- •- -- -- • + •- •"• • - + M00 

CrcCATCGrrcAGAAAAAGGCmCAlTGAaXUAGTOtm:^^ 

'xCTCTGrACCAlOCOCCTACkTMCCICGC^^ ^ 

igagaoItoctogccg^^ 
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TOXnGCTXXXIACIXXXXUTAACICCKntT^^ 

1501 + + + + ..- + . 

ACCGACGAOGCriCACCGCrATrcAGCACAGAATGGCOCAAa^^ 

CACACAGCCCAGCnXXi^GaSAACGAWTACACCGAACTGAGATACCT 

IfiOl + + + ♦ + 

CrrUlGrCGGGICGAACCICGCnCCTGGATGTGCXTrGAC^ 

AGCTXrcOXnVUGCGGCAGGGTDCGAACAGGAGAGCGCAOtM 

17B1-- ♦ •*■ *- + ---. + • 

TXXrATAGGOCATlXXXXXnCOCAGCCITCnXXnxnXXK^ 

TCTx;ACTrcAGCX7rc(;ATiTTTGrrcATtxrKxnx:^ 

1801- + + + + + - 

AGACTGAAClYXXIAGCrAAAAACACTACXjAGCAGTCOCC^^ 

CCCrnrCCTCACATGTIXriTICCTtKXT^^ 

1901- + + + + - + - 

CGGAAAACGAGriX7rACAAGAAAGGA(:GCA\TAGGGGACrAAGACACCTATI 

ENDE.hep BEGIN M.rcp 

Itp.'' I.-.l.'^ 5 

ACOGAGOGCAA£XXX;iGaXXXXX:ACXX:GTGAGCrCAGCAGCrCOGTAACrr < 

2001 + -. •*■ + + + - - 

TGGCIXXXX7nXXXX:ACGOC<XXDGTGCCCACT0GGGTGGTC 

AGGGGItTAAGGOGGO(mnAOGGC(XXXAC4GCGGCrcTt^ ^ 

2101- + + + + li. 

TCXrCAGAlTCCGC0CCACATC<XCCCGCIX7IX:XX^ 

* TtxxxxnxxnxxocTCTocxntxnxnTcc^ x 

2»1 * * + + h- 

A<X:CCCACGAGpCGACiGCGACpiCAA(X7ra^^ ^ 

TCMJAGCTCXnXJItXXSACCArACACCGGTGATrAA ^ 
2301 + + ♦ + 

ACCTCX^AGCACAGOCIXXnATXntXXXACTAATTAGCACCACAT^^ J 

GrcCCTtXXAACCGAaUTCTXXTIXXJAfXK'.GAIXTACXXXTCAAAGCT^ b 
2A01 "* + + +. + . 5 

CGGOCAaXTIXXXnXKTAGAACX;AGCTCCa7rAGATGGaX7ITIXXX^^ 

A.\(:xnGCItXnXX7IXX;ACGTAGACCATCCAGACGCAGCGCTC^ 

2501 - + + + + .. + . 

llXKJACGACCAGCACCIXXATCTGGTAGGriCIXXXTrOGCGAGGCT^ W 
(XTAACXXXXAOGCACACGCACriXriXXXXIACTCAACXXXXXTfCT U 

2501 + + + --• + + X 

GGTKXrOGTODtnXTIXXXrrcACACCCGTCAGTIXXXKXX^ X 

ACXxxriTtxxxxxxxxxntx;A'iTxxx;Accxx:AGriTACJirAGG(:cit'Aix;A (yj 

27D1 + + + + + — - 

TCCGGAAGCCGCGCGGCVGCTACCGCIXXXrGTCAATGAGTCCX^GAGT^ 
CICTACACACTCAGCCACATCXIAGGOCGAGCTCGGCGC^^ ri 

2801 + - + + + + U 

GAGAlXTIXTrcAGTCGGTGTAGCTOCGGCIXXJAGarGaxrnxrr^^ — 

GGCGGAATKXTGCACTGTrcATTCCXITCAGGTTGTGGGCCTATC 

2901 + + + + + • 

<XGCCITAACGCGTCACAAGCTAAGGCAG;rCX:AACACCCXX;ATA 

CGCX;ATCrATXXXX;AGTr.CCAC:GCrx:GA\ACGOCGAAl'ntXX7IGCA\CX; 

3001 - + + + + - 

Gccx:rAGATACGGCix:ACG(rix',cGC<K:rrixKxxx:nAAAGGCAOG'rixxr 

AGCATITGGOGTIGGATCACAACXrAAGTCGaXIATnXXKKXKJACCG^ 

3101 + + + - + • + 

TCGTAAACrCCAACCrAGTGTXXnrCAGCXKXTAAAaXXKXnXX^ 



SUSStlTUTS SHEET 



wo 93/07897 



PCT/US92/09075 



< 



I- 



< 

01 

d 



GATAGTTACCGGATAAGGCXKAGaXTrcGGGClGAAOGGaXXnian^ 

+ + + + + 

CTATCAATCGCCTATlXXXXXriCGCCACCCCGA^^ 

GCCnYSAGCATTCAGAAAGCGCXAGCXTIXXXXMAGGGAGA^ 

+ + + + .-.«+ I7D0 

rcGCACTCGTAAClXnTIXXXXK7IXK:GAAGGGClltJLX;iU^ 
IXrAGGGGCAAAOXXnXXnVlTCTmTAGrnXKntXXX^ 

+ + + --- + + 1800 . 

GCrrcCCCCrrrGCGGACCATAr.AAATATCAGGACAGCCXIAAAGOGGTGG 

TGGAAAAACGCCAGCAACCCGGCCITTTTACGTTXXnXHXCTI^^ 

+ + + + + 1900 

.CCTirriXXXXnXDGTIXXrGCCGGAAAAATGOCAAGGACXXXUA^ 

CCGTATTACCGCCITGAGTCAGCrcATACXGCTOGCCGCACC 

+ + + + - + 2D00 

GGCATAA1XKXXX;AAAC1XL\CTC!GACTATCGCGAG(XXXX^^ 



lXXXXKXXnxn<riXXXnOCTAC<XG0CCATIX:AGGCGGCAGGGGGri^ 

+ + + + + 2100 

U. AGCCXXKXIACACACCGAGCATGGGOGCGTAACTrcOGCpGTCXa^^ 

GAAACXrrCCTCGAAACGACXKIATGTCTnCCIXXriGGI^^ 

X + + + - + + 2S» 

K CXriXXTAGGAGClTTGClXXXTTACACAAGGAGGACCAACCATGriCCAOCA 

+ + + + r + 2«) 

^ TtXXX:CCCTCACACGTCAACA<XXXACCOGGGAGT<XX:TITATA<^ 
O GCGTGAGCCAaJimrGACGiUTTIXUGCAGCTClt^^ 



< txx:ACTCGC7ixx:AGcxxx:TraTAAAcrccixx^ 

tTriXVrfXXXn-AGGCrGCXXXTrACATOGAGGCXIAAOCCAAC^ 
GCAr^GGGATCaXXXXXX:ATGTAGClCCGciTGC^ ^ 

•AGCGa:axxxx7ixxx:ATcaxnxxrc4ACGCGATCGTtxxx^ 

+ + + + + + 2(500 

A TCGCGGGCCCOCAGGGTAGGOGACGCCrrcOGCTAGCACXDOCnTArc 
X:cr,\ATAC)CXXXXXXXTAAOCCOCJCCXXrACAlXXXX 

+ + + + ---- +2300 

TGGCmTrx:GCCKXCCATnXXX:GAGCGTATGTACOGCX:<^ 
CC\AAA,\rcra5GCCACATXXXXrnXXUAACCGAATtXXr^^ 

+ -t- + + + 2800 

IXTrrmXXXXXXTGGTGTAGCGCACaiTITGCCTTAaXIAGCltLiGTn^ 
GCOGCGCItXX:GTCAGC\GACGACGTACAAAGCGGaa:GAax:a^ 

+ + + • + + 2900 

CGGCGCGACax:AGrrcGICTGGTGCATGlTlXXX:a;AGGCIXX:GGCX;/\^^ 
[j_ CCCTCATGCGGATCTACCTOCOGACmXIAACGrrGGACGGAC^ 

+ + + + + 3000 

GGGAGTACGCXTAGAlXX:ACGGCTGGGCXnitX::ACCKXXTGGACCOGGC 

ACGTCriGTOtXXXIACCGCTACCGGACAGaiAGGTOCXXXX^ 

+ + + + + --- + 3100 

ItXjlCACAGGGCXriXXXXJATGGOCltrroGCTCCACGCGCXX^^ 

CGTCXTTCTAOGAGGCX^lCACrcAGTGaXXXXAGTaXX^ 

+ :•-•• + + --- + +3200 

GACCACATCCTXXXXnGTCAGTCACGCGCGGTCAGCCGCTACAGOGa^ 

SUBSTITUTE SHEET 



3201 ■*■ + + + - - • ' 

ToocccarccnccctxxjmcccxxxjciccnxncAAC^^ 



OGGCTAC\GCGACGGCIACAACCGGCAGCOGACTGTCOGCAAAA^ 

3301- - - + + + •*■ + - 

CXXXUTGriCGCKXXXUTt7ITCCXXXnCGCCTXUC4GGO^^ 



CIXXTKXXXXnXXTTCGCGCAGGAAOGCAGCGAGTGGCK^^ 

3401 + + + + + 

CAGCAGGCCX«AGCACXXXXnXXTI«XnCGCrcACCGACX:GGCTC 



GGOOtX:AAAaXXX:AAACATITCGGGCTtX:ATCTGGACAC 

OCCXXXnriGOCCXnTIXTrAAAGaXXMCGTAGACXn^^ 
AAAGGCXTACAAOGSAAGCaUCAATOCAaXXrK^^ 

3601 + + + + 

TirCCGGGTGTTOCITCGGCTGTrAGGTGGCGACAAGATI^^ 

CAGGTAAAACItXnXKTTAGACGCTAGTrntriXMnTI^^ 

3701 ■♦■ + + 

GrrcCATirrcAGCAOCATCIXXXSATCAAAAGACCAAAOCCGG^^ 

GG<7ritTACGAATCntX7ItX;ATACCAAGOCATITC 

3801 + + + + ' 

CaiAAGATGCmGAACCAGCTATtXnrCXXn'AAAGGCGACTrATAW^ 

Mnllipic Cloning Site 

S B 
N Kg 
End M. rep o c pi 

t I n I 

I I II 

TKirAGrCTTGTGGIKXCATCCCmGCSXXXXXXXXX^ 

AACATCACAACACXIACCGTAGGCAaXXXXXXXMXMXrATGGriCT^ 

S 

Stop Condons p Begin Transcription Terminator 

3 Frames I 

I 

GACGTAGrrTAACTAGCGTACGATCXJATCGCCAGGCATCAAATAAAACG 
ClWZATCAATTGATDGCATGCTAGCIGACGGTCCGTAGTmTrnXX^ 

? f e FIO. I 

I c c 

1 I 1 

I I I 

CATCATCGCOGOGCrrGATCA 

4101 + +4120 

GTAGTACOGOaSOCACrAGT 



SUBSTITUTE SHEET 



W093/»7»97 PCr/lJS92/09075 

FIO. I^Bb> 

TOOCXXnCAGOCATGCATGGAGGCmXKnAT^ 

+ + + ..-- + 33QQ 

GGCGCAGTOGGTrACmACCIYXXTrAAGGATACIU^^ 

GCHGACXXXXXXXXMAGGGGCTCGAATCAOCGGACTA^ 

+ + + + - +3400 

GCACTXXXXXXXXXniaXCGAGCITAGrrGGCCroA^ 

TCCACXXXXXX;AACGCATOCGCGCCrATCAOGACXUaSAGGGOCAC^^ 

+ + + + + 3S0O 

ACX»IGC»G0GCTIXXX7rAGGCXXXX*ATAGriXXntXntXn^^ 



^ CrcGGCTATOGGGCCM\GGAAAGAGCGrnXXXX:AGAACAGGAAGCXXX^^ 

+ + + + +3600 

£i? CSAGCOGATAGCODGCraJITICKXXIAaKXXnxnTCT^^ 

CGAGCXXXntnX3GCCXXXX71'lir G lXXXXXX7r^ 

2 . + + + + '"Zl±ll +^ 

Li_ OXrrCGCXXZACAGCGCCCCCAAGGCAOCCCXXIAAGGCAAOGTnK^ 

X 

- G'lUiCGTK X XyitllTllXJIlXXXKXCGTriTGAATACCAGCCAG^ 

^ . + + + + tmi +3800 

^ CAGAGCAACXXIACAAAGCAAaKXKKKAAAACTTATCCritXXT^^ 

U 

< G<X;AGCTCACCGCCAGAATCGGTtX7ITGTGC7IXUTGTAan^ 

2 . + + + + .--.-.... + ««-^---+3g00 

XXrTXX;AGTGGCGGTCTTAGCCACCAACACCACTACATO::ACXX^ 

H 

I . 

E B P E n 

DXcN vPc d C S 

rbocm uso I 1 

R o H I tR I I 

I I V I I I II I II 

TAAATCTAGATATOCATGGATCCAGCTGCAGAATItXjAAGCTrAT^ 

. + + + + +4000 

ATmGATXn'ATAGGTACCTAGCnXX5ACXntnTAAGCTltX5AATAGCTA(^ 



AAAGGCTCAGTroAAAGACItXXX X ' l - Iia/l ' irrA lUWlUUi^i^ 

. + + + + + +4100 

TTCCGAGTCAGCrnxnXSACCCGGAAAGCAAAATAGACAACAAACAGGCCG 
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3 fM3 

xf^^ F- 1 O. 31 OL 

x> 

GTCGACCACCAAGGGCAOCATCnxnxXTIXKXKXACXXXrGTlt^^ 

* *■ + + •«- + 

CAGCIXXnXXTITCCOGTCKnViGAGACXJAACCCGGTGGGGCAACCGCCGTC 

GCGAGGGTlXX:GACCGCTGCAACTCCCXK;TGCAACCITGlX:CCGGTCrAT 

+ + + + + . 

CGCTCCXAAGGCTGGOGAOGTroAGGGCCACGriTGGAACAGGGCCAGATAA 

GCGCAGG0GGGGGGCICrAl'lUb"14'lUlCAGCATCGAAAGTAG<XAGATCA 

201 + + + + + 

CXXXnCCGCCCCCOGAGATAAGCAAACAGTCGTAGCTITCATCGGTCT^ 

TTGCAGACCXXnXXjAAAGAAAAATGGCCAGAGCGOGAAAACACCCrCrGA 

AACGTCTCGGG^CCTTra 

Nde I 

I 

40J TGGGTGrimGCX:GACCACATATGGG<XXK7ICAAGATAGGTTITTACCC3^ 

ACX:ACAGACGG<nX3GTGTATAC0CGGCCAGTIXnVVTCCAAAAAAT^ 

502 TTOAAGOCTGAGAGTroCACAGGAGTIXXAAOCOGGTAGOC 

+ + + + 

AACTIXXKJACnxnXXACGTGTCCTCAACGTIXXKXXlATC J) 

BanJII ^ 

AGCGCAGaXX^GGATCCAAGOCTCATAa;TCAACOCGCAGGACGGl ^(7ir,A O 

601 + --- + + + -.-+- t: 

tcgcgtcckxrctcoaggttcggactatckragtixkkkigtc 

Im V R 

Int startl -j- 

CGCGGGCGAGAAGCGGCTCATCGAGATGGAGACCTGGACCCClXrCACAGG h- 

701 + + + + -. + . > 

CGAGCGCCCGCTCTTCGOCGAGTAGCrCTACCIXriXKJACCrGGGGAGGlTTr > 
Ini LA GEKR L I EMETWT P PQ 

ACCrGGAAGTGGCnXXritX;AGCGCGACCTCGCAGACGGCACCAG0GAT(nX3 
801 + + + -- + + - p 

Int T R K W L V E R D L A D G T R D L 2 

CCCTCACAGAGATGACGCCAGCTCIXX7IXKXnGCGTXKnr,G^ 
901 + + + + + 

GCCAGTGTCnnViCTGOGGritX;AGACCACGCACGCACCA(XrGGCCCT^ 
Int VTEMTPA LVRAWWAGMG 

GGTGATGAACACAGCGGTCX^AGGACAAGCTGATOGCAGAGAACXrGGTGCOGG 

1001 + . + + + +-- 

CCACTACrnyroTCGCCACCItXnXTrrcGACTAGCGTC^^ 

X»t V M N T A V E D K LI A E N PGR 

Bglll 
I 

GAGGAGCTGGACATCGTCGCCGCIX3AGATCTT0GAGCACTACCGGATCGa^ 
1101 + + + + + 

CTCCTtDGACCIXTTAGCACCGGCGACTCTAGAAGCTCGTGATGGOCT 
iKjtEELDIVAAEIFEIF YRIAA 

TIXXXrCCAAGGACATCGTGGACGACGGCATGAOGATGAAGCTCaX^^ 

1331 + + + + , + . 

AAGCGGCGTTCCTGTAGCA(XTGCrGCCGTACTGCrACTlX:GAGG<XCA(^^ 
iMt RRKDIVDDGMTMKLRVR 
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no. 31 lo 

CAGCTCXX:iGAGAGax;iGMaSACAGGGOGAAC^^ 
GCnXXAGOGACIOOXXSICITOCIXnXD^ 

.GAGAAGTCACGlCGTOGAGCnAG\6c^ 

- ^ .4. + 4. ^. 300 

TCXXTACGCAAOGTlGGCGCmCGGGICCAGTOTClXZAGCC^^ 

xxac<K:jx:Gcr^ 400 

alt P core 

01CGGCrGCAlCCItnSIUU31GGA^ 

AGAGCCGAan!AGGAGAT3£Aa:iTIXnTr^ 
GAG4GGAGA€CI»iGOGGOIAO(?iaXXX;ATGC^^ 

ccixnx:cTcixx*A3x:AA(X!raxx:AG^ ^ 

PslI 

I 

GGTACTACGCGCKXIAGAaiTAOGACAACAAG^LIEG^ 
. *""""■'"" + ""**""""- + '------••- + -- -- -- -- • + ••»......+ 

^ CCATGATGax;GA<XnncnXMATGCrGTIX7ITCTA(^^ 

ro YY AL QT YD NKMDA EA W 

Inl sUrl? 

X ACCGGGCGAAGAAGGCAOCCGCCAGCGCCATCACGCTGGAGGAGTAC 

I- + + + + . + 800 

- CCTXXKX:CGCTrcTIO:GmXXXK7nXCGGTAGlXXX; 

^DRAKKAAASAIT LE E Y 

TACAGOGGGCACGOGGAGaxrGCATCrACCCGGTGCrAGGTGAAGrcG ««« 
X ---.»--.+---«--.-- + -«. + 

^ ATGTCGCOCGTtXXX:CimCXJGCGTAGATGGGCC^ 

< ySGIlAERRIYPVLGEVA 

2 GTAGGAAGCACCO(UCTXK:(XXX:aXX:ATtX:CTACAAOGTCTO 

ATCxnTxxnxxMcix^oiiGcxxicoGT^ 

RKIIPTA R RHA YNV L RA 

ATCGAGCAGAAGGCAGCCC^TGAGCXKXSAOGTAGAGGCGCTCAOGCCr 
-»- + + + + 1100 

TAGCrCGTCTIXX:GTCG<XTACIXX;CGCTGCATCTCCGCGAC^ 
I EQKAA DER DVE A LTP 

CATACATCCTtXXXrrCGA!CGAG<XTCOGCTTCGGAGAGCT^ 

+ + + + -.-- + 12D0 

GTATGTAGGAOrGCACCltXTOGGAGGCCAAGCCTCTCXJACTAGCn^ 
YILAWTSLRFGELIEL 

GCXGTCXXXXnTCmXXTKXXXSAACAAGATCG lWl 

♦ + + + 1300 

OGGCAOCGCGAAGGGCGCACCLr il X/ l l LM AGCAGCAACCGTIXKXXnT 
RGAS R VGNKIVVGN AK 
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GACrCltXXMTKXJAAGCGTCCTGTCACGGTrcCCXXnCACGTXXXX;^ 
1301 + ♦ + + 

CTGGCAGGCCAGCTrcGCAGGACACIXX:CAACXX:GGA(ritX'JVCX:<KX'I 
InlT VRSKR PVTV P PIIVA E 

GCATIXXrKX7roACX::AOGAOGCACKKX:AAC3CGGCI^ 
1401 + + + 

CGTAAGGACTACItXriXKnXXXTItXrGTTGGCCGACAGCTK^ 

IntAFLVTTTQGNRLSKSA 



GCATCCACGACCTCXXXXXntTrcGGCGCTACCnTCGCCGCIXliGG^ 
1501 + --- + + . + 

CGTAG<7IXKnXX;A(kx:GOGACAGa:GCGATGCAAGCCXX:GACnCCGT^ 
Ini IIIDLRAVGATFAAQA 

GGCGATGAAGTAOIAGATGGCGTCrrcAGGCOCGCGACGAGGCTATCGC 
1601 + + + + + 

CCGCTACrrcATGGTCTACCGCAGACrcCGGGCGCKKrrcCGATAGCG 
IntAMKYQMA SEA RDE AIa' 

CXrAAGGACACIGAGTXXTTAAAGAGGGGGGTmnTGTCAGTACGCGAA 
1701 + + + + 

GGGTrcCTCTGACTCAGGATITC7irxrca:AAAGAACAC^ 

PvuII 

I ^ 

GGCACCAGCaXrCCCGCCGCCAGGAGCATTGCCGTrcCCGCCAGCroA 
1801 + --. + + + 

^^cxTirxnixxxxxxxxxxxxxTitxnaTrAAOG^^ 



GGCGACTITCCGGCGAaxnXJAGGATGTOCATCACAGAGCCrcCGGGAC 
1901 + + + + 

COCCTGAAAGGCOGCTGaSACTCCrACAGCTAGTGlXnCGCAGGCOC^ 



2001 --- + +- + + ^ 

GAOC7roODCXi\GGaXIXlUGOGACIXnT^^ 
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ATGAirCX;AGCGCACATGAAGGACCG!IM3GAAGATGAAC4AGGGaXC^ 

+ ♦ ♦ + +1400 

CTACTAGGCTCGCGTGTACITCOltXXXmTICTACT^^ 

MI RAHMKDSTKMNKG P E 



- + + + + + + 1SX) 

CAAGTGGTTCAGCCACTlXrGCACCXIATOCCGTICTAG^^ 

PTKSLKRG YAKIGRPE LR 

5 GGTCXrGACGACCAAGGAGCTtUTGGODDCntnt^GGTCACA 

^ + + + + + +I«X) 

OrACGCIXXntXnTtXTOCUCTAGCXaSOCAGAGC^ 

^ GATTKELMARLGHTTPRM 
X TGAGGCGATC7IYX::AAGCT(XXX:AAQia^ 

- 'm:^'^ — ••«• -+1700 
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